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“Mental Holography”: 
Stereograms Portraying Ambiguously 
Perceivable Surfaces 


By BELA JULESZ and STEPHEN C. JOHNSON 
(Manuscript received June 28, 1968) 


An algorithm has been devised that can generate the same stereogram 
for two (or more) selected surfaces. Prior to this development the only 
known ambiguous stereograms have been periodic grid patterns that could 
be perceived at various parallel depth planes. The new algorithm, however, 
can portray two. (or more) selected surfaces of general shapes and the 
observer can perceive each of these surfaces, but only one at a time. The 
technique ts an extension of random-dot stereograms and it is shown that 
in most cases adequate degrees of freedom remain for coloring the random- 
dot texture. Since these ambiguously perceivable stereograms permit the 
portrayal of both the visible and the hidden surfaces of objects, they are 
analogous to holograms. However, in the case of holograms the observer 
has to inspect them from various positions, while for ambiguous stereograms 
at 1s the mind of the observer that wanders around. 


I. INTRODUCTION 


There are many limitations in portraying our three-dimensional en- 
vironment on a two-dimensional surface. One of the primary short- 
comings is the difficulty of effectively representing the hidden surfaces 
of objects together with the visible ones. Perspective drawings and 
even stereoscopic images did not alleviate this limitation. The cubist 
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artists tried to place the hidden surfaces of their objects side by side 
with the visible ones, but the results were rather confusing. The usual 
representation by multiple projections solves the problem geometri- 
cally yet it is very difficult to combine these projections into a unified 
spatial percept. The invention of panoramagrams using lenticular 
screens by Ivest and multiple lens arrays (fly’s eye) by Lippmann? 
portrayed the three-dimensional objects within a wide angle, but in 
order to inspect some hidden surfaces the observer still has to move 
around the panoramagram. Furthermore, certain hidden surfaces (such 
as the boundaries of inside cavities of opaque objects) stayed in- 
visible. Since holograms (invented by Gabor? and improved by Leith 
and Upatnieks)* are similar to panoramagrams only more simply 
made, some change in the geometry of the optical rays has to be ini- 
tiated in order to obtain various stored organizations. This is usually 
achieved by the observer when inspecting the hologram from various 
angles. 

This article describes a method of generating ambiguously perceiv- 
able stereograms. These stereograms contain several predetermined 
surfaces, out of which only one can be perceived at a time. Moreover, 
some surfaces might be “internal,” hidden from every angle. In order 
to inspect the various surfaces the viewer can sit still; it is his mind 
that wanders around the object. 


II. AMBIGUOUS STEREOGRAMS 


Random-dot stereograms, introduced in this journal in 1960, 
have shifted interest to the problem of how the visual system resolves 
ambiguities.» Indeed, in random-dot stereograms, hundreds of dots 
are presented on a horizontal line in the left and right eye’s views, and 
the observer is confronted with ambiguities as of which of the many 
dots in the left retinal projection corresponds to a given point in the 
right retinal projection. This problem is further amplified for random- 
dot stereograms, since no monocular familiarity or depth cues are 
provided that would aid perception. Findings with random-dot stereo- 
grams have shown that the visual system selects that organization 
which yields dense surfaces, and all other possible organizations pass 
unnoticed.>* 

Would it be possible to create random-dot stereograms which por- 
tray simultaneously more than one dense surface? If possible, which 
organization would be preferentially perceived? That there are am- 
biguous stereograms is well known in psychology. Grid-like structures 
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containing vertical bars of constant periodicity (such as wallpapers 
and old-fashioned radiators) can be fused at multiple depth levels 
since the binocular disparity can be any integer multiple of the hori- 
zontal periodicity. In Ref. 7, such a periodic random-dot stereogram 
has been demonstrated which could be perceived as a plane in front 
of or behind the real plane of the printed page. These periodic random- 
dot patterns have been successfully used in studies of perception” * yet 
are limited in their scope, since only parallel planar surfaces can be 
portrayed by this method. 

This report describes a general algorithm which generates a single 
stereogram portraying two (or more) specified surfaces. That such 
stereograms can exist is based on the fact that certain areas are seen 
by one eye only and thus can be freely selected for one surface. Also 
segments in which the two (or more) surfaces coincide add to the 
degrees of freedom, since these surfaces can be covered by any random 
texture at will. In general, there is no restriction on the surfaces to be 
portrayed simultaneously. Provided that the number of surfaces is 
restricted to two and the resolution is fine (the number of samples is 
large) there is enough freedom in choosing the texture elements that 
the formation of monocularly perceivable short periodicities can be 
prevented. 

Before the general algorithm is discussed in detail, a brief illustration 
is given of the basic idea in Fig. 1. Fig. 1 (solid line shows how points P? 
and P% (belonging to surface B) have to be colored identically to P; (the 
original point of fixation, belonging to surface A), in order to obtain 
the same retinal projections for both surfaces. Fig. 1 (dotted line) shows 
how this requirement forces P? and P%{ to be colored the same. Fig. 
1 (dashed line) shows the next step P? and P{. As this procedure is con- 
tinued the algorithm assigns the color of P; to a gamut of points be- 
longing to the two surfaces. 

For a new point of fixation on either surface A or B, then there are 
two possibilities. Either this point of fixation already has been assigned 
a color by the previous algorithm or can be colored freely some brightness 
value. In the latter case the above described algorithm is continued 
which assigns this chosen color to another set of points belonging to 
the two surfaces. This procedure continues until each point on either 
surface has been colored. It is interesting to notice that the color of 
any point depends on some global relationship between surfaces A and B. 

In this algorithm, as long as only two surfaces have to be portrayed 
each, iterative step assigns one new constraint. In the case of three 
surfaces, each iterative step generates two new constraints which in 
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SURFACE A 


SURFACE B 





Fig. 1— Iterative steps in the algorithm that generates the same stereoscopic 
projections for surfaces A and B. 


turn generate 4, 8, 16, ... new constraints. This exponential prolifera- 
tion of constraints drastically reduces the degrees of freedom for three 
or more surfaces. Therefore, in the forthcoming examples we restrict 
ourselves to two surfaces to be portrayed. 

The study of how such ambiguous stereograms are perceived can 
be of considerable scientific value but also permits some useful ap- 
plications. It is now possible to portray hidden surfaces of objects 
together with the visible ones. Since in these ambiguous stereograms 
only one surface can be seen at a time, multivalued functions of two 
variables can be portrayed. For instance, if one surface is chosen to 
be the front view of an object (such as a statue or a machine part) 
and the other surface is chosen to be the rear view, one can obtain an 
entire 360 degree impression of an object, since the perception of the 
two surfaces may be alternated at will. 


Il. THE GENERAL ALGORITHM 


The algorithm is an extension of the technique of random-dot 
stereograms.® ® The following simple example will give an insight into 
the workings of the general algorithm. The two surfaces, A and B, 
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to be portrayed by the stereogram are given in the z-z plane in Fig. 
2 and for simplicity are selected as cylindrical; (that is, 2 = f4(z) 
and z = fz(x), independent of y). Since stereoscopic vision operates 
on corresponding single rows in the two views, the algorithm given 
here applies to any surface (not only to cylindrical ones). 

In order to construct the stereogram we must specify the textures 
T,(x) and Tr(x) for the left and right images, respectively. The right 
image of the stereogram is selected as the perpendicular projection of 
Fig. 2, while the left image is viewed from an angle of 45 degrees. 

Examine for a moment the case where we have just one figure, z 
= f(x). We may pick the texture Tp(x) at random; the texture 7’, (x) 
is now basically chosen as follows: 


Tile + f(x)] = T p(x). (1) 


This just expresses the fact that a point x seen in the left image is 
displaced horizontally by a distance f(x) when viewed in the right 
image. 

There are two necessary qualifications to the above rule: 


(1) If « + f(x) and 2’ + f(z’) are equal for x unequal to 2’, in fact 


VIEW OF e 








VIEW OF 
LEFT 


eee 
~ SURFACE A 


REFERENCE PLANE 


JSURFACE B 
7 


Fig. 2— A simple example of two surfaces to be portrayed. (The cross section 
is indicated in the z-z plane.) 
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only the point corresponding to min (2, x’) is seen by the left eye, the 
other being “in the shadow.” The constraint (1) thus does not hold for 
the larger of x and 2’; we say the larger is obscured in this case. 
(ii) In the event that, after applying all the constraints, there are 
some values of 7',(x) not determined by Tr, these values may be 
chosen at random. 

Julesz and Miller developed these ideas extensively.® 

Now to color two figures, z = f4(x) and z = fp(x), we apply a 
similar method; the main difference is that there are more constraints, 
so that the right image can no longer be chosen at random. The images 
must represent both A and B; thus, if JT; and T represent the left 
and right textures as above we have basically 


Tile + fa(x)] = Tr(a) 
Tr[x + fa(x)] = Tela) 


for all x. Qualification 7 is still valid, and may serve to eliminate one 
or both of the above constraints for certain values of x. From this it 
is also seen that if x and 2’ are distinct, and neither @ nor 2’ is ob- 
scured, then 


fa(z) + & = fp(v’) + 2’ implies Tr(v) = Tr(a’) = Tile + fa(a)]. 
(3) 


(2) 


These are also easily seen to be the only constraints on Tp. 

Once again, qualification 7% is valid; anything not explicitly con- 
strained may be chosen at random. 

A simple example of the computational ease of this algorithm is 
given in Table I. Notice that x, f4 (2), and fz(x) are given in the first 
three rows. The rows labeled Ly and Ly, are formed by putting the 
integer x into positions x + fa(x) and « + fp(x), respectively, sub- 
ject to qualification 7. A * indicates a position that has been ‘“uncov- 
ered” by the shifting process—the texture here may be chosen at ran- 
dom if not otherwise constrained. 

We may read off our constraints directly from Table I; if La(a) ¥ 
L,(x), and neither L4(x) or Lz(x) is a *, then 

Tr[La(x)] = Te{Le(2)I. (4) 

After all such constraints have been applied to the right image, the 

left image can be generated by reference to equation (2) and qualifica- 


tion #2. Table I gives final values for Tp and T'; in terms of randomly 
chosen texture values ¢;, ts, ts, . 


TaBLeE I PrRoJEcTIONS* 


x 1 2 3 4 5 6 7 8 9 10 
fal) 0 0 1 1 2 2 3 3 4 4 
I p(x) 0 0 —l —1 —1 —2 —2 —2 —3 —3 
L(t) 1 2 * 3 4 * 5 6 * 7 
Lp(a) 1 2 4 5 7 8 10 11 13 14 
Tr (x) ty ts ta t3 ts tg t3 ts tg ts 
Tr (x) i to ts t4 ts ts ty tg ty t3 


* The left and right projections of the surfaces given in Fig. 2 before and after the constraints. 
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<———__ Fig. 3—Stereogram of unambiguous pyramidal staircase in front of the 
printed page with a 128 xX 128 picture element resolution. There are nine 
steps, altogether. Use the viewers fastened inside the back cover of this 
issue to see stereoptical effect. Place the red filter over your left eye. 


IV. CONCRETE EXAMPLES 


The demonstration will be quite general and the 1000 x 1000 dot 
resolution permits the portrayal of surfaces having complex shapes. 
The only restriction on the surfaces will be the use of cylindrical 
shapes. For this case, the algorithm determines the same constraints 
for each row but of course within the degrees of freedom each row is 
independently colored by a random process. For general surfaces 
each row would have to be computed separately, which would increase 
the computation time (now about two minutes on a GE 645 computer) 
nearly a thousand times. The cylindrical surfaces have another ad- 
vantage; they permit us to use two unambiguous surfaces at the top 
and bottom margins of the stereogram respectively, to facilitate per- 
ceptual reversals for the unexperienced observer. 

Besides the 1000 x 1000 dot resolution there are a few stereograms 
composed of 128 x 128 picture elements. In the 1000 x 1000 dot array 
each dot (picture element) can take three different brightness values; 
for the coarser array, each picture element can take eight brightness 
values. This is done by using three (eight) characters (blank, period 
sign, degree sign, asterisk, and so on) of the General Dynamics (Strom- 
berg Carlson) 4060 microfilm printer. For the 128 x 128 array the 
probability of each of the eight characters is equal (44). For the 1000 
x 1000 array the probability of using the light and heavy period signs 
is 0.05 while the probability for the blank is 0.9. Thus the average 
number of portrayed dots in these stereograms is 10°, which is within 
the resolution capabilities of the printing process. 

Figure 3 shows an unambiguous pyramidal staircase in front of the 
real plane of the printed page with a 128 x 128 picture element resolu- 
tion. In Fig. 4 the left and right images of Fig. 3 have been inter- 
changed and the pyramidal staircase is descending behind the printed 
plane. To view these illustrations use the anaglyphoscopes fastened 
inside the back cover of this issue. Put the red filter over your left eye. 


<———__ Fig. 4—Stereogram, identical to Fig. 3, except that the left and right 
images have been interchanged. The unambiguous pyramidal staircase is 
descending behind the page. 
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<——_ Fig. 5— Ambiguous stereogram, that contains both the pyramidal stair- 
cases in front of and behind the plane of the printed page as given in Figs. 
3 and 4. Both of these organizations can be obtained when sterosopically 
viewing Fig. 5, but only one at a time. 


If you have good stereopsis, the depth should become apparent to you 
within several seconds. 

Figure 5 demonstrates an ambiguous stereogram that contains both 
the pyramidal staircases above and below the printed plane as given in 
Figs. 3 and 4. Both of the organizations can be obtained when stereo- 
scopically viewing Fig. 5, but only one at a time. In a brief study 
Fig. 3 or Fig. 4 have been shown to 21 subjects who have never seen 
these stimuli before. After viewing one of these unambiguous stimuli 
for a minute the ambiguous stimulus of Fig. 5 has been shown. Ten 
subjects perceived that organization in the ambiguous stereogram 
which corresponded to the previous unambiguous organization. Nine 
subjects perceived the ambiguous stereogram always as the descending 
staircase, while two subjects always as the ascending staircase. There 
was no attempt on our part to train these subjects to learn to reverse 
the organization. On the other hand, the reader can learn easily the 
reversal if he alternates between Figs. 3 or 4 prior to viewing Fig. 5. 
Convergence movements of the eyes can influence the reversals, but 
it might require careful studies to determine whether reversal could be 
obtained while the eyes are immobilized. In Fig. 5, the maximum 
disparity is 8 picture elements. The degrees of freedom are 46 (out of 
128). 

The degrees of freedom can be greatly increased and the staircasing 
greatly reduced by increasing the resolution to 1000 xX 1000 dots. 
Such an ambiguous stereogram is shown in Fig. 6, portraying a single 
wedge behind the printed plane and two wedges in front of the printed 
plane. The maximum depth is +60 picture elements (dots), and to facili- 
tate perceptual reversal a 150 picture element wide margin in the upper 
and lower portions of the images contains the unambiguous surfaces 
A and B, respectively. A 50 picture element wide gap at zero depth 
level separates the unambiguous surfaces from the ambiguous organiza- 


<——-__ Fig. 6 — Ambiguous stereogram of 1000 x 1000 picture element resolu- 
tion with unambiguous margins in the upper and lower areas. Surface A 
is a wedge behind the plane of the printed page, while surface B is a double 
wedge in front of the page. Either one of the two surfaces can be perceived 
at will when stereosopically viewed, yet reversal can be aided by viewing 
the upper or lower margins, respectively. 
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<——_ Fig. 7— Ambiguous stereogram with unambiguous margins in the upper 
and lower areas. Surface A is a horizontal plane in front of the printed 
plane, while surface B is a wedge behind the printed plane. For viewing 
instructions see Fig. 6. 


tion. When looking at the upper portion of the fused stereogram the 
unambiguous ascending wedges usually carry with them the percept of 
the ambiguous wedges, while when looking down the ambiguous organi- 
zation reverses. Of course, these unambiguous margins serve only as an 
aid, and the ambiguous organization can be reversed at will when 
the margins are covered up. This stereogram is similar to Fig. 5, yet 
because of increased resolution the degrees of freedom are 359 (out 
of 1000 samples). This is adequate to portray images without excessive 
formation of perceivable periodic stripes. 

Particularly interesting is Fig. 7, which has 1000 x 1000 resolution. 
Here the upper margin contains the unambiguous surface A which 
is a plane with 40 picture element disparity, while the lower margin 
contains the unambiguous surface B which portrays a descending 
wedge having a maximum disparity of -60 picture elements. In this 
example there is no gap between the ambiguous and unambiguous 
surfaces. Organization A is very strong and our everyday experience 
would suggest that when each dot of a front plane is seen by both 
eyes without any hidden areas present, then this plane should be the 
only percept. Yet, as Fig. 7 demonstrates, it is relatively easy to ob- 
tain the other organization too. Here the degrees of freedom are only 
99 (out of 1000), yet in spite of this low degree of freedom the image 
quality is very good. 

When we try to portray more than two surfaces, the degrees of 
freedom rapidly diminish. On the other hand, some of the stereograms 
with two surfaces yield some additional percepts. For instance, in 
Fig. 7 after the front plane is perceived, sometimes the percept of an 
ascending wedge above the plane can be obtained too. 

Figure 8 shows a stereogram that can be perceived in many different 
ways as illustrated in Fig. 9. The unambiguous margins correspond 
to the two shapes in Fig. 9a, but the reader might obtain the other 
percepts as well. Obviously the strongest constraints are obtained for 


<———__ Fig. 8— Ambiguous stereogram with unambiguous margins in the upper 
and lower areas. Two slanted planes (above the plane of the printed page) 
that interest each other are portrayed. Fig. 9 shows the various percepts 
which can be obtained. 
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Fig. 9—Schematic illustration of the various ways Fig. 8 can be perceived. 
The cross sections in the x-z plane are indicated. 


parallel surfaces A and B with a few dots separation in depth. This 
occurs near the intersection of the two surfaces, yielding visible 
clusters of dots having the same brightness values. 

Figure 10 portrays a cosine function and a cosine function of les- 
ser amplitude and half periodicity. Since both surfaces are in front 
of the surround and close together, an interesting perceptual phenom- 
enon can be experienced. For the previous demonstrations only one 
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Fig. 10— Ambiguous stereogram with unambiguous margins in the upper and 
lower areas. It portrays a cosine function and a cosine function of lesser ampli- 
tude (height) and half periodicity. Both surfaces appear in front of the printed 
plane. 
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organization could be perceived at a time and considerable amounts 
of convergence movements had to be initiated in order to get rid of 
the prevailing organization and to bias the other organization. In Fig. 
10 it is possible to retain one organization and meanwhile start to 
perceive the other organization. The perceptual effect is that of a 
transparent surface behind which another transparent surface is seen. 
However, this double perception is not a stable state and it is easier 
to perceive only one organization at a time. The degrees of freedom 
are 119 (out of 1000). 

Finally Fig. 11 shows a case in which three surfaces are portrayed. 
Besides a cosine function in front and behind the surround there is 
a plane with zero disparity. Interestingly enough for this special case 
the degrees of freeedom are not additionally reduced. As long as the 
surfaces A and B are each other’s mirror images and the third sur- 
face is a plane with zero disparity, it is possible to portray three sur- 
faces without additional constraints. The degrees of freedom are 78 
(out of 1000). An unambiguous rectangle at the top and bottom mar- 
gin is presented at +70 picture element disparities in order to aid 
perceptual reversal. 


V. CONCLUSIONS 


From these demonstrations it is possible to see some of the limita- 
tions of the ambiguous random-dot stereograms. In order to avoid 
short periodicities the surfaces to be portrayed should not intersect 
or come closer than a few picture elements in the z direction. The 
worst case is if the two surfaces are one picture element apart. In 
this case the periodicity is one and the two surfaces are formed of 
horizontal lines having the same color. As we have discussed the two 
surfaces can coincide but after separation they have to separate in a 
discontinuous fashion having a jump in depth of several picture ele- 
ments. In Fig. 12 special care has been taken to separate the two sur- 
faces in depth. Therefore a cosine function (to be seen in front of the 
plane of the printed page) has been placed on a 10 picture element 
high pedestal. The other surface is a wedge (behind the printed plane). 
The degrees of freedom are 128 (out of 1000). Because of this pedestal 
the shortest periodicity is limited to ten and the resulting stereogram 
can be easily fused and reversed. 

Another limitation is the rapid decrease in the degrees of freedom 
as the number of surfaces is three or more. By increasing the resolu- 
tion of the images the absolute degree of freedom increases as well, 
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<————_ Fig. 11 — Ambiguous stereogram with two unambiguous rectangles in the 
upper and lower areas. It portrays three surfaces. A cosine function in front 
of the plane of the printed page, the same function behind the plane, and 
the printed plane, itsclf (a plane with zero disparity). 


so that three or more surfaces could be adequately portrayed. Un- 
fortunately, the resolving acuities of the eyes limit the size of indi- 
vidual picture elements to about 1 minute of arc. With finer image 
resolution more than one picture element (of different brightness 
levels) falls on a single receptor of the retinas and the image contrast 
rapidly decreases. 

A third limitation of ambiguous stereograms is the unavoidable fact 
that because of the constraints there will be some other dense surfaces 
perceivable besides the selected two surfaces (as pointed out in the 
demonstration). Since most of these phantom surfaces are perceived 
at greater depth than the desired ones, there is a way to eliminate 
them. If the desired surfaces span the depth limits for fusion, then 
the phantom surfaces will be outside the region of maximum dis- 
parity for stereoscopic fusion. 

As long as the surfaces to be portrayed are placed such that they 
stay separate in distance and the number of surfaces is two, the 
above technique gives satisfactory results. The obtained results are 
analogous to holography, but only superficially. After all, holograms 
contain a vast number of stereoscopic views, while ambiguous stereo- 
grams contain only a single one. Holography is based on the diffrac- 
tion properties of coherent wave optics, while our technique uses 
plain geometrical optics. For holography the observer has to move 
around the hologram in order to inspect it from various angles, while 
for ambiguous stereograms the viewer can stand still. It is his mind 
that wanders around the object. Furthermore, ambiguous stereograms 
can portray any mathematical surface, including completely hidden 
ones, from any view, this could be obtained by computer-aided or 
computer-generated holograms as well, but would require more effort. 

The advantages of holography or lenticular screen panoramagrams 
could be combined with ambiguous stereoscopy. It might be possible 
to generate stereograms by computer and place them behind a lentic- 


<————__ Fig. 12— Ambiguous stereogram with wide unambiguous margins. It 
portrays a cosine function in front of the printed plane and a wedge behind 
the plane. 
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ular screen such that each separate stereoscopic view is constrained 
by our algorithm. Since each view is independent from each other, 
no further reduction of the obtainable degrees of freedom will result. 
Unfortunately, the generation of ambiguous stereograms for the most 
general surfaces is at the limit of present computer economies and to 
compute hundreds of them for a single portrayal is certainly imprac- 
tical. Yet, with next generation computers these and similar repre- 
sentations can be tried. 

The emphasis of this article has been on the reporting of a new 
tool for pictorial representations and the obvious psychological im- 
plications have been only briefly mentioned. We can now study, for 
example, whether convergence motions of the eyes are necessary to 
destroy an existing perceptual organization in order to reverse to 
the other one. Furthermore, each of the ambiguous organizations 
can be biased by a few randomly introduced unambiguous picture 
elements in order to counteract natural bias. (This biasing technique 
has been successfully tried for periodic random-dot patterns in a 
study of perception time.” °) 

A pilot study was reported above which showed that if one of 
the organizations has been first presented as an unambiguous stereo- 
gram, then the perception of the ambiguous stereogram could be 
influenced accordingly. It remains to be seen whether prior auditory 
or tactile information would similarly influence perception. 

Originally, random-dot stereograms were conceived to remove from 
the monocular images all the familiarity cues and Gestalt factors 
that influence perception in uncontrollable ways. It is therefore 
somewhat unexpected that this further development of the technique 
in the form of ambiguous random-dot stereograms seem to provide 
a powerful tool for the study of Gestalt factors. In this instance the 
shapes are the configurations of the surfaces in depth. Questions 
exemplified in Figs. 8 and 9 can be studied, such as whether good 
Gestalt or good continuation outweigh the reduction of disparity. 
Reversible figures have been frequently used in perceptual psychology 
such as Necker cubes (two-dimensional outline drawings of a cube), 
ambiguous staircases, and so on. However, these stimuli have been 
selected from a small repertoire and exploited certain ambiguities 
inherent in two-dimensional drawings. That ambiguities can be pro- 
duced in three-dimensions without practical limitations on the or- 
ganizations to be portrayed is a result which seems far from trivial. 
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The Capacity of Multiple Beam 


Waveguides and Optical 
Delay Lines 


By D. GLOGE and D. WEINER 
(Manuscript received July 10, 1968) 


The capacity of a beam waveguide can be increased by transmitting a 
multitude of Gaussian beams in such a way that they are clearly resolved at 
the receiving end. Various systems with maximum capacity but different 
crosstalk sensitivity are discussed. Linking the available channels end to 
end in an optical cavity produces a delay line or storage device. An optimized 
system ts described which has surprisingly large storage capacity. For the 
analysis of both lens guides and optical cavities, a phase space representation 
of Gaussian beams is used which avoids cumbersome mathematics. 


I. INTRODUCTION 


A Fabry-Perot interferometer with curved mirrors can be used as 
an optical delay line by inserting a laser beam through a small center 
hole in one mirror.t The beam performs many off-axis round trips 
before leaving the interferometer through the entrance hole.? Reference 
1 suggests that the injection and retrieval of the beam could be im- 
proved by mismatching beam and cavity. A systematic study is carried 
out here to find the longest folded path that starts and ends in the cen- 
ter hole, thus optimizing the system for maximum storage capacity. 

Very similar to this problem is the analysis of a periodic lens guide 
in which many beams are to be transmitted in such a way that they 
are clearly resolvable at the receiver end. One such system is a 
transmission link that forms an image array of modulators in the 
receiver plane. The possible density of channels is given by the number 
of resolvable spots in this plane.® 

The investigation of all possible Gaussian beams transmitted si- 
multaneously in a guide will show that this is only one among many 
possible systems. All these systems exhibit the maximum theoretical 
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capacity as given by the classical limit,‘ but may be affected differ- 
ently by guide imperfections. 

The first part of this study will outline a simple geometrical method 
of describing gaussian beams avoiding the cumbersome mathematics 
connected with gaussian beam optics.’ Based on this method, it will 
be easy to find the optimum storage cavity and to investigate various 
multiple beam transmission systems. 


II. PHASE PLANE AND PHASE SPACE 


In continuous or periodic guiding media, the “phase space” repre- 
sentation of paraxial rays is very convenient. Consider, for example, 
the two-dimensional continuous lens-like medium in Fig. la in which 
the index of refraction is a function of the transverse coordinate only: 


nx) = na( 1 — iz). (1) 


Call A the “focusing parameter.” The paraxial ray solutions are sine 
waves with the period 


P = 27A (2) 


as shown in Fig. 1b.° Figure 1c shows a “phase plane” in which every 
ray of Fig. 1b is represented by a point. The coordinates of the points 
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Fig. 1— Rays in a homogeneous guiding medium. (a) The square-law index 
profile. (b) Rays oscillating with various phases. (c) The corresponding points 
in the phase plane. 
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correspond to the position x and the slope wv’ of the ray multiplied by 
A. As the rays proceed in the square-law medium, the points orbit 
around the origin of the phase plane while their position with respect 
to one another stays the same. 

Steier? has shown that for every gaussian light beam one can find 
a packet of rays equivalent to this beam in the sense that the packet 
envelope gives the beam width and the average ray slope is perpen- 
dicular to the beam phase front. The ray packet may be represented 
by an array of points in the phase plane. Consider, for example, a 
fundamental gaussian beam propagating in the square-law medium of 
Fig. 1. The 1/e-half width of such a beam is 


(2) ; 


where A is the optical wavelength. The equivalent ray packet is 
basically the one shown in Fig. 1b with ray amplitudes w. The cor- 
responding points in the phase plane occupy a circle with radius w 
similar to the presentation in Fig. le. 

Following these arguments, any gaussian beam—varying in posi- 
tion, slope, or width along the guide—may be represented by its array 
of points in the phase plane. The points form a “phase spot” in the 
phase plane whose shape and position determine the beam parameters. 
Once the phase spot is known at one point along the guide, it can be 
found for any other point by simply rotating the phase plane. The 
correspondence rules between the phase spot and the beam parameters 
follow from Steier’s ray racket equivalence and are explained in the 
following examples. 

Figure 2 shows a gaussian beam of width w entering the guide with 
a slope a. The beam phase front is tilted by a and consequently the 
average slope of all rays in the ray packet must be a. This condition 
is satisfied by a circular phase spot displaced horizontally by aA. As 
the beam proceeds in the guide, the phase spot orbits around the 
origin of the phase plane. Projection of the phase spot on the vertical 
axis yields the beam width and position. The horizontal displace- 
ment determines the slope. Notice that Fig. 2 and the following figures 
are two-dimensional beam representations. The phase spots should 
not be confused with a cross-sectional view of the beam. 


Figure 3 shows a beam that enters the guide with a phase front curved 
with a radius R. Consequently, the average slopes of the equivalent 
rays vary linearly across the ray packet. In the phase plane horizontal 
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Fig. 2— The phase spot for a beam entering at an angle a. 


slices of the phase spot are displaced horizontally by zA/R according 
to their position zx which distorts the circular phase spot to an ellipse. 
Notice that the area of the phase spot is not changed by this process. 
From equation (2) one finds that this area is rw? = Ad. It is the same 
for any gaussian beam of a given wavelength in a given guiding medium. 
A beam, for example, that enters with a plane phase front and a half 
width u ~ w has an elliptic phase spot with the principal axes u 
and 


v = Ad/ru. (4) 


If the guiding medium is not homogeneous along the z axis but 
a periodic sequence of lenses, the phase plane method is still valuable 
though, with the same convenience, the beam can only be described 
in the planes of the lenses and not in the sections between. This, 
however, is in general sufficient because, no matter what the features 
of the gaussian beam, it will always be largest at the lenses and 
therefore it will be this width that determines the aperture of the 
whole system. 

For a periodic sequence of lenses with focal length f, spaced at a 
distance d, the convergence parameter is® 





A = d/sin ® (5) 
xf 
i 
i 
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Fig. 3— The phase spot for a beam entering with a curved phase front. 
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with 


cos ® = 1 — d/2f (6) 


and the equivalent ray period is 27 d/®. 

For thin biconvex lenses, it is the symmetry planes of the lenses 
where the Gaussian beam can be defined most conveniently. Beam 
width and phase front curvature in this plane determine the equiv- 
alent phase spot. Counterclockwise rotation of the phase spot by 
an angle & corresponds to passage from one lens to the next. 

The phase plane method may also be extended to nonperiodic 
structures. It is restricted, however, to the paraxial approximation, 
to square-law guiding profiles (including uniform dielectrics), and 
to coherent beams with Gaussian intensity profile and spherical phase 
fronts. 

Notice that the phase plane considers only deflection and displace- 
ment in x direction and that a similar definition exists for the y 
coordinate. The two phase planes combined yield the four-dimensional 
phase space, and the phase spot becomes a four-dimensional] structure. 


III. SPATIALLY INDEPENDENT CHANNELS 


The capacity of a beam waveguide can be increased by transmitting 
several gaussian beams separated spatially. The tolerable crosstalk 
determines the separation of the individual beams. For convenience, 
let us describe around every beam a fictitious tube, k times wider than 
the 1/e width, where k is chosen so that the crosstalk requirement is 
met when these tubes just touch. In practice, the main source of cross- 
talk will be beam distortion and scattering rather than the spread of 
the ideal beam. The factor k, therefore, will vary from guide to guide 
according to the tolerances of the guiding components. 

Figure 4 shows a two-dimensional square-law medium of width 2a, 
and the corresponding phase plane. In order for the beams to clear 
the guide walls, the phase spots must stay within the circle r = a, 
while orbiting in the phase plane. Considering that the phase spots 
require an area k? Ad to fulfill the crosstalk conditions, it is easy to 
find the phase spots that make the best use of the available guide 
(see Fig. 4). The phase spots determine the beam parameters. 

As the beams oscillate in the guide, they overlap in certain areas. 
There are, however, cross sections spaced by distances zA at which 
all beams are separated. One of these cross sections may be chosen as 
the receiver plane. 
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Fig. 4—A possible distribution of phase spots in the useful phase area. 


If the guiding medium is not homogeneous along the z axis, but 
consists of a periodic sequence of lenses of width 2a,, the useful phase 
area may be different from that in Fig. 4. In this case, discrete aper- 
tures have to be considered at the positions of the lenses. (The inter- 
mediate guide diameter in a lens guide in general is immaterial, be- 
cause between the lenses the beams have a smaller cross section and 
separation than at the lenses). Figure 5 shows the useful phase area 
and the spot pattern for confocally arranged lenses. 

This case surmises that, proceeding from lens to lens, the rotation 
of the phase pattern is exactly 90°. Even if the tolerances for the focal 
lengths and lens spacings are very strict, these rotations will even- 
tually, after many lenses, get out of step with respect to the lens posi- 
tions and aperturing will occur when the phase pattern is rotated at 
any angle in the phase plane. This situation is shown in Fig. 6. If 
this happens, the useful phase plane is restricted to a circular area. 
It seems, therefore, that Fig. 4 represents a more general case for 
practical applications. 
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Fig. 5 — The useful phase area for a confocal imaging system. 
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Fig. 6— The phase pattern is out of step with the confocal lens position. 


The number of phase spots that can be fitted into the circular area 
ma. of Fig. 4 is approximately 


Ts 1 Os (7) 


~ Ak’uv 4k? Ad? 








n 


assuming that the area occupied by one spot may be approximated by 
the dotted rectangle in Fig. 4. For large numbers of beams (n = 10), 
this approximation is satisfactory. 

There is no reason why the beams have to be arranged the way they 
are in Fig. 4. There is no restriction on shape and location of the phase 
spots in the useful phase plane. Of course, arranged as in Fig. 4, the 
beams are clearly separated at distinct cross sections, which makes 
launching and receiving a simple and straightforward matter. 

EK. A. J. Marcatili of Bell Telephone Laboratories suggested the 
arrangement shown in Fig. 7 and demonstrated how such beams may 
be launched: a common lens is used for every overlapping group of 
beams feeding every member of the group at a different angle. At a 
distance zA/2 from this lens, or at multiples of this distance, the 
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Fig. 7—A distribution of the phase spots that minimizes distortion. 
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beams arrange themselves in groups again and could be separated by 
means similar to the transmitter scheme. 

Whereas the beams in Fig. 4 vary considerably in width as they 
propagate along the guide, the beams in Fig. 7 constantly keep the 
minimum width w as given by (3). From the nature of the distortions 
on optical surfaces, it might be expected that large beams will be dis- 
torted more than small ones. In this case, the arrangement in Fig. 7 
might be less susceptive to mutual interference of beams and con- 
sequently permit a smaller k factor and a larger capacity. For equal 
k there is no difference in capacity of both schemes, at least not within 
the accuracy of (7) which was derived for large n. Other arrangements 
as well as combinations of the schemes in Figs. 4 and 7 may be prac- 
tical for certain cases. 

By applying the phase space technique to three-dimensional sys- 
tems, some of the lucidity is lost, but one can still gain some interest- 
ing results. If the beam waveguide has a cylindrical cross section of 
radius A, a circular area with the radius 


G, == (A? = af) (8) 


in the y-phase plane is available simultaneously with the area 7a?. The 
total useful phase space is consequently 


wA? 
s= [wal dra’). Q) 
0 
By inserting (8) into (9), one has 


S = At. (10) 


Allowing rectangular areas for the phase spots in both the zx- and 
y-phase plane, as in the two-dimensional example, the total capacity 
is found to be approximately 


gs 1744 x At 


(4k? A\/m)* 32 KFA? ? (11) 


N= 
If the total number of beams is large (n 2 100), the rectangular approxi- 
mation for the area occupied by the phase spots is satisfactory and (11) 
holds independent of the way in which the beams are arranged in the 
guide. Figure 8 shows a nomogram based on (11) for an optical wave- 
length of 1 micron. Given the radius, lens spacing, and filling factor k of 
a lens guide, one can easily find the possible capacity. Consider, for 
example, lenses spaced confocally by 100 m. If their useful optical area 
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has a radius of 10 cm, and the filling factor is k = 3, approximately 
300 beams could be transmitted in parallel. 

Comparing (11) with M. von Laue’s formula for the spatial degrees 
of freeedom of an optical system,‘ one finds that N approaches the 
classical limit for k about 1, that is, the beams would have to overlap 
at their 1/e amplitudes in order for the capacity of the guide to be 
fully used. There are many reasons why this limit cannot be reached 
in practice. Particularly important are the imperfections in the guide 
itself, 


IV. BEAMS IN CAVITIES 


The rules of gaussian beam geometry can also be used for optical 
cavities. Considering the cavity as a folded beam waveguide, possible 
beam paths can be traced using the phase plane. This way the useful 
capacity can be found for delay or storage applications. 

Figure 9a shows a 2-dimensional square-law medium. The two 
plane surfaces M, and Mz, are highly reflecting mirrors and form an 
optical cavity. A beam, launched off-axis through the center hole of 
mirror M,, would perform several round trips between the mirrors 
before hitting the entrance hole and leaving the cavity. Figure 9a un- 
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Fig. 8— Nomogram evaluating the guide capacity for \ = 1 micron. 
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Fig. 9— The beam path in a storage cavity consisting of a focusing medium 
between the mirrors M, and Me. (a) The beam path unrolled along the axis. 
(b) The phase spots at the mirror M:. 


rolls the beam path along the guide axis. Figure 9b shows the phase 
spots for the cross sections 1 to 5 which correspond to reflections of 
the mirror M,. In agreement with the previous considerations, the 
spots correspond to tubes k times wider than the 1/e-width of the 
beam. It is assumed that interference between different round trips 
and distortion is tolerable if the phase spots do not intersect one an- 
other, the boundary of the useful phase area, and the area occupied 
by the center hole (area between the broken lines in Fig. 9b). 

Obviously, the total number of round trips can be increased by 
decreasing the angle 6 shown in Fig. 9b. 6 is smallest when the phase 
spots just touch the broken lines. Also there should be an optimum 
shape of the phase spots for which @ is a minimum. Though the area 
k? Ad of a phase spot is fixed, the main axes u and v can be chosen. 
Particularly if the cavity radius a, is large and a large number of 
round trips is to be stored in the cavity, the best ellipses will be long 
and thin, and the center hole diameter 2kv will be small. 

It is now a simple matter of geometry to calculate the exact param- 
eters. From the requirement that spot 1 touch the broken line, one 
finds 


2kv(a, — ku) 
a? — 2a,ku + kv? 
As indicated above, v will be much smaller than a, and u for optimum 
systems with large capacity. By neglecting v? in the denominator and 
replacing v by (4) in the numerator, one has 


sin 9 = (12) 


2A\.k a, — ku 


Ta, a, — ku (13) 


sin 62 ——— 
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The derivative d sin 6/du vanishes for the optimum value 


thon = & E - as | (14) 


and it turns out that 
» Ak 
= Ga, 1 — 1/8) 


is indeed small for large a,. Under these conditions @ will also be small, 
and by replacing sin @ by 6 one finds 


~ Ark? sa 
tone BEL AT ao 


2 
TOs 





(15) 





The maximum number of round trips is 


a TS ra. | 1 ii 
deny ame .0 a _* (2)3 | ° (17) 


For n = 10, this formula gives satisfactory results. 








The proper length of the cavity in Figure 8a is 
BA = 3(7 — Omin) A (18) 


with Onin from (16). If, instead of a homogeneously focusing medium, 
concave mirrors are used, (5) and (6) determine the mirror spacing d 
and the focal length f. In connection with (17) and (18) one has 


d = A cos BB A (19) 
and 
d : Citi cay x. 
BS pp On Ep = on (20) 


Notice that for large n the mirrors are almost confocally spaced. 
Knowing the solution in the x plane, one would like to solve the 
three-dimensional problem by just doing the same in the y plane. 
Figure 10 shows equivalent phase planes for the z- and y-axes of the 
end mirror. Projection into the mirror plane yields the actual beam 
cross sections. Though it will be shown later that this arrangement is 
not quite optimum, Fig. 10 is very useful to calculate the cavity radius 
A necessary to accommodate this or, later on, an improved beam path. 
The maximum displacement a,/(2)* = a,/(2)*% occurs simultane- 
ously in the z and y directions. The total displacement of the beam 
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Fig. 10— Construction of the beam cross sections at the mirror surface to 
determine the radius A of a cylindrical cavity. 


axis is therefore a, = a,. The maximum displacement coincides with 
the largest beam cross section whose radius is Upp: as given in (14). 
Consequently, a radius 


A ng. Ag a2 - oul (21) 


is necessary for a cylindrical cavity. 

The capacity can be increased drastically by deliberately introduc- 
ing astigmatism as described in Ref. 1. Suppose the beam behaves 
in the x plane as shown in Fig. 9 but, simultaneously, oscillates in the 
y plane in such a way that it returns to the center of the y plane al- 
ready after 4 round trips. It can only leave the cavity when it is dis- 
‘placed neither in the z- nor the y-direction, and that happens for the 
first time after 20 round trips. Generally speaking, one achieves 


N = 2n(n — 1) (22) 


transits by this method. Technically, this can be done by warping one 
or the other of the mirrors slightly. Writing (20) for both x and y 
plane and subtracting one from the other yields, by using (22), 
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Dn US wig (23) 


For large n the approximation N ~ 2n? can be made and by using 

(17) and (21) one has 
~, 20 A* 1 

VE ENE B+ OT 
This is about a factor of 6 less than the number given in (11), that is, 
the described method does not fill the total available capacity. Notice 
that, in Fig. 9b, there is room for exactly one more set of spots be- 
tween the used spots. This space would be filled by a beam that fol- 
lowed the same path as the described one, but in opposite direction. 
The additional capacity could be exploited by reflecting the existing 
beam back on itself. It can be shown that, in the three-dimensional 
scheme, there is space for an additional totally independent path in 
the cavity. Both paths could be linked by an outside mirror. By reflect- 
ing the two linked beams back into itself, the number of round trips 
could be quadrupled. 

Without considering these sophistications, let us investigate what 
(24) means in terms of storage capacity. For large N (19) can be 
used to calculate the total length of the beam path which, with the 
numerical factors evaluated, is 


A‘* 

~ 2dNKS 
Surprisingly enough, this path is longest for a small cavity length d. 
Of course, the number of bounces (and consequently the losses) in- 
crease in a short cavity. The best mirrors available introduce a reflec- 
tion loss of 0.05 percent or 43.5 dB attenuation after 20,000 bounces.® 
This corresponds to 10us delay in a 15 cm cavity with mirrors 4 cm 
in diameter. If part of the loss is compensated by an amplifying ma- 
terial in the cavity, the number of round trips is eventually limited by 
scattering in the system.® 

Perpetual recirculation of PCM information could be achieved by 
using an arrangement that amplifies 27-pulses*® or a fast saturating 
absorber in combination with a suitable laser amplifier.7t In both 
cases only pulses of a certain length and intensity are amplified, while 
any other signal is attenuated. It would be sufficient to provide the 
amplication at a few particular parts of the folded path where it is 
spatially separated from other round trips. If enough amplification 


1 = Nd (25) 
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of this kind is provided to make up for all losses, pulses of the proper 
kind could circulate in the cavity perpetually without noise building 
up. 

If the mirrors are moved so close to one another that they touch at 
the circumferences, the paraxial approximation, basis for the previous 
calculations, loses its validity. By extrapolating (25) into this range, 
however, one finds some interesting, though speculative, results. The 
center of the confocal mirrors are now spaced by d = 2A/(3)*%. For 
a bandwidth b small compared to the light frequency v, the capacity is 


3 
piel oe (26) 


with J from (25). 
It is easy to calculate the volume V of this nut-shaped cavity. It is 


Verna’. (27) 


The number of the degrees of freedom of a cavity whose dimensions 
are large compared to A is independent of its shape and has the value* 


8bV 
Cth = ou * (28) 





In other words, the maximum number of bits which V can hold in the 
form of electromagnetic energy is ci. By using (26), (27), and (28), 
one finds the (extrapolated) efficiency of the beamfolding method to be 


1 . 
(3.3k)* 
For k = 38, this efficiency is only 10-*, but even then a capacity of 16 
k bit seems achievable with 1 GHz bandwidth in a cavity with the 
radius A = 1 cm. 





= (29) 


Cc 
Cth 


V. CONCLUSIONS 


Various methods can be used to transmit a multitude of beams 
through a lens guide in such a way that all beams are clearly resolv- 
able at the receiving end. The number of beams which can be trans- 
mitted is proportional to the square of the guide cross section and 
may be of the order of 300 for a guide of 10 cm radius with lenses 


“* See, for example, Ref. 4. 
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‘spaced confocally by 100 m. In this case, the centers of adjacent 
beams would be spaced by 6 beamwidths at particular cross sections 
in the guide. 

Linking the available channels end to end in a cavity produces a 
delay line or storage device. At 1 micron wavelength a 10 psec delay 
can be achieved in an optimized cavity, 15 cm long and 4 cm in diam- 
eter. The storage capacity is inversely proportional to the cavity 
length. Hence, the ultimate configuration would consist of two con- 
focal mirrors with their circumferences touching. Extrapolating the 
paraxial theory to this situation yields a capacity of 16kbit for a 
bandwidth of 1 GHz if the nut-shaped cavity has a radius of only 
1 cm. 
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A Model of a Domestic Satellite 


Communication System 


By LEROY C. TILLOTSON 
(Manuscript received July 12, 1968) 


A preliminary study of a domestic satellite system is reported. Since 
the objective was to determine what might ultimately be possible, no attempt 
1s made to relate system capacity to estimated needs; rather an effort has 
been made to conceive a system to carry the greatest possible amount of 
traffic. By making full use of modern rocket technology including the 
Saturn V class propulsion systems, highly directive multibeam antennas 
operating in the range from 16 to 40 GHz, interference resistant modulation 
methods, highly stabilized synchronous repeater platforms, and integrated 
solid state microwave repeater electronics, a very large communication 
capacity 1s obtained. For example, using 50 ground stations and 50 satel- 
lites operating in bands at 20 and 80 GHz, each 4 GHz wide, a total of 
100 million voice circuits, or equivalent, can be provided. 


I. INTRODUCTION 


Domestic satellite systems can be expected to handle a large amount 
of trafic compared with that carried by transoceanic systems; this is 
true even if for various reasons only a fraction of the total domestic 
traffic goes via satellite. For this reason among others, the presently 
allocated frequency bands at 4 and 6 GHz are not well suited to 
domestic use. Frequencies above 10 GHz are attractive in that they 
are not as heavily loaded as the lower frequency bands. But they are 
subject to propagation difficulties which generate new problems in 
their use.* The amount of atmosphere traversed by a ground-to-satel- 
lite path can be relatively small if the look angle is restricted to eleva- 
tions which are not too small. However, attenuation will be large un- 
der conditions of excessive rainfall and diverse ground terminals will 


* Similar arguments apply to terrestrial systems operating above 10 GHz. These 
are not discussed here, but some of the propagation studies described here were 
designed with the needs of terrestrial as well as satellite systems in mind. 
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be needed when common carrier grade continuity of service is required. 

The frequency range from 10 to 40 GHz can offer a unique oppor- 
tunity if broad continuous bands are allocated for satellite service. 
Modern rocket technology gives us the opportunity of placing large 
multichannel satellite repeaters in synchronous (24 hour) equatorial 
or inclined orbits, thus making it possible to exploit broad communica- 
tion bands and to use orbit space very efficiently. This would make 
possible best use of orbit space, frequency space, and the investment 
in facilities since very large amounts of traffic could be carried by 
each satellite repeater. The availability of such frequency alloca- 
tions and adequate orbit space is assumed in what follows. A basic 
assumption made here is that frequency space and orbit space are 
precious and limited resources which must be conserved. 

Radio frequencies above 10 GHz have disadvantages in propaga- 
tion, but their short wavelength makes possible very narrow beams 
from antennas of a size suitable for use on a contemporary satellite. 
If we combine this feature with interference resistant modulation 
techniques, such as PCM and multiple feed antennas, we can construct 
multichannel, multibeam satellites which can communicate simul- 
taneously with many ground stations using only one frequency as- 
signment. The idea can also be used in reverse to enable a single 
ground station to communicate simultaneously with several satellites. 
Thus if we have N channels per frequency assignment, S satellites, G 
ground stations, and every satellite “sees” every ground station, and 
vice versa, we have a total communication capacity of C=GxSxN 
channels. With tens of ground stations and tens of satellites working 
in bands a few GHz wide, this can results in a truly prodigious ca- 
pacity. 

In order to take full advantage of these possibilities, very reliable, 
efficient, and small radio repeaters for satellites will be required. Solid- 
state integrated microwave circuits and devices will help insure relia- 
bility and small size, and we have reason to hope that eventually ef- 
ficient use of de power can also be obtained. This is important since 
a significant part of the total in-orbit satellite cost results from the 
solar power supply. 


II. SYSTEM CONCEPT 


Designers of communications systems normally have as their ob- 
jective a facility to meet a fairly well defined need and naturally try 
to choose a system configuration which is most economical in serving 
that need. Thus, one of the most important parameters influencing 
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system design is the present traffic level and the expected growth rate. 
In addition to the amount, the geographical distribution of traffic is 
important. If a large part of the traffic terminates at one node, the 
usable system capacity will be reached when all of the channels from 
the satellites to that node are full. If the traffic volume continues to 
increase but the geographical distribution remains unchanged, growth 
must be handled by other (terrestrial) means. 

While much of the present-day telephone traffic is concentrated in 
large metropolitan centers, an assumption that this trend will con- 
tinue into the indefinite future may not be warranted. There are many 
signs that some of these areas have reached a ‘critical mass” and are 
beginning to explode; future growth may be spread much more uni- 
formly over the United States. At any rate, we have assumed uniform 
traffic density when calculating total system capacity; to the extent 
that this is untrue, part of the system potential will be unrealizable. 
This approach has the disadvantage of making the study somewhat 
more abstract, but our goal is to display the potential of a satellite 
system designed for domestic communication rather than to design a 
specific system. 

To this end we postulate 50 ground stations distributed more or less 
uniformly over the continental United States working with 50 satel- 
lites in synchronous orbit stationed due south of the United States 
and spaced 1° apart. Each satellite is precisely stabilized in attitude, 
carries a 10 meter multibeam antenna operable at 20 and 30 GHz, 
transmits down with a power output of 2 watts at 20 GHz and receives 
from the ground at 30 GHz. We further assume that two bands each 
4. GHz wide and centered at 20 and 30 GHz are assigned to this 
service. Given these frequency bands, we can design for eight 630 
megabit per second two-way channels using four-phase angle modula- 
tion. To serve 50 ground stations each satellite will require (eight 
RF channels per beam) X (50 beams) = 400 repeaters. At 10,000 
voice circuits per RF channel (630 megabits per second capacity) this 
results in 4 million (one-way) voice circuits per satellite. In terms of 
present day telephone traffic, this is a very large cross-section, but for 
broadband services which require 100 (Picturephone® visual tele- 
phone) to 1,000 (television) times more bandwidth, it is not so large. 

If the ground stations are equipped with 10 x 17 meter multibeam 
antennas, 10 watt 30 GHz transmitters and a cooled parametric re- 
ceiver preamplifier operating at 20 GHz with an equivalent noise 
temperature of 150°K, one can calculate carrier-to-noise ratios with 
the result shown in Table I. Several factors are worthy of note. The 
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ground-to-satellite link is assumed to suffer interference —39 dB rela- 
tive to the desired carrier; the down link interference is assumed —33 
dB relative to the carrier. In both cases the most important interfer- 
ing signals are admitted by the sidelobe response of the multibeam 
satellite antenna. Interference at the satellite caused by minor lobes 
of the ground station antennas can be controlled by spacing of the 
satellites, and in fact, if the satellites are 1° apart as assumed and 
the 10 x 17 meter 30 GHz ground station antennas have a response 


Taste I— C/N on Heavy RovtsEs 


System Parameters 


d = 23,000 miles = 3.7 X 107 m. 
f = 30 GHz (up link) 


f = 20 GHz (down link) 

Width of RF assignments (2) 4000 MHz 
Two-way RF channels per beam 8 
Pulse rate 315 X 108/s 
Bit rate (4 ¢ angle mod.) 630 X 106/s 

per satellite 8(G) 
RF channels per ground station 8(S) 

in system 8(G)(S) 
Picturephone® circuits per RF channel 100. 
Voice circuits per RF channel 10,000. 


Rocket class: Saturn V 


Satellite 
Antenna 
Diameter 10m 
Diameter illuminated by each feed 8.3m 
Allowance for illumination taper (—2 dB) 0.63 
Effective area 34.0 m? 
Transponder power output +3 dBw 
Receiver noise figure 6 dB 
Receiver noise bandwidth 400 MHz 
Ground Station 
Antenna 
Dimensions 10 X 17m 
Sector illuminated by each feed 10 X 10m 
Allowance for illumination taper 0.80 
Other problems (—1.5 dB) 0.70 
Effective area 56.0 m? 
Transmitter power output +10 dBw 
Receiver noise temperature, 7’, 150°K 


Receiver noise bandwidth 350 MHz 
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Carrier-to-Noise Ratio Up link at 30 GHz 
Net path loss Pp/Pp = Xd?/ArpAR 78.6 dB 
ca system loss, including filters 1 dB 
Receiver power, Pp = Pp — 80.6 dB —70.6 dBw 
Satellite receiver noise —112 
C/N at satellite 41.4 
C/I for ground stations 264 miles apart 39 dB 
C/N + I at satellite 37.0 

Down link at 20 GHz 

Net path loss Pr/Pp =) d?/ApAr 82.1 dB 
OR aer ea system loss, including filters 0.5 dB 
Received power, Pp = Pr — 83.1 dB —80.1 
Ground station receiver noise (no rain) —121.5 dBw 
C/N in ground receiver (no rain) 41.4 
C/I for stations 275 miles apart 33 
C/N + I (including up link) 31.1 
C/N + I for 10-7 error rate 20.0 a 
Margin (no rain attenuation) | +11.1 
C/N + I for 10™ error rate 17.0 
C/N + I (for 10 dB rain attenuation) 25.7 
Margin during moderate rain +8.7 


envelope which follows a fourth power law, interference at the nearest 
neighbor will be so small as to be negligible. 

A similar comment applies to interference at a ground receiver 
admitted by the sidelobe response of the 20 GHz 10 X 17 meter 
ground station antenna. Under conditions of normal propagation (no 
rain) the system will be interference limited on both the up and down 
links. This is unusual in present day radio system design and comes 
about because we are attempting to get the maximum possible com- 
munication capacity. Any further improvement will require better 
antennas. Notice also that the down link margin for a 10° error rate 
is +8.7 dB during a rainstorm which produces 10 dB of attenuation.* 
For a similar, but independent fade on the up link the margin is 14 dB. 

Another important feature of the model satellite system is the fact 
that a given repeater can be seen from many areas in the United 
States. This makes possible a flexibility not found in terrestrial links 

* Calculation of error rates corresponding to the stated ratios of C/(N + I) 


indicates much smaller rates than those given in Table I. The stated larger values 
allow for imperfection in a high bit rate PCM receiver. 
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where standby or alternate facilities must be provided link by link. 
In the satellite case we have, at least in principle, the possibility of 
making good a fault in one part of the network with capacity nor- 
mally assigned to a distant link and temporarily unused because of 
time zone or other differences. 

On the other hand, if we adopt the now generally accepted rule 
that only one 24-hour synchronous satellite link is allowed per circuit, 
this ubiquitousness of satellite system ground terminals is not just a 
convenience, it is a necessity. If a satellite system is to carry the bulk 
of the domestic traffic, we require not just circuits from outlying sta- 
tions into a network node, but circuits which go directly from every 
place to every other place; thus we require many ground stations 
for complete coverage. The fact that channels from every ground sta- 
tion pass through each repeater makes available in one place channels 
serving widely separated parts of the country and provides us with 
the opportunity to reroute channels on board a satellite in accordance 
with changing traffic patterns. 

To accomplish this, we must provide a suitable switching matrix; 
this is most conveniently implemented at the intermediate frequency 
which can be common to all repeaters, and hence channels can be 
interchanged as desired upon command from the ground. This 
flexibility also makes it possible to make good a limited amount of 
failed apparatus on board the satellite. 


III. PROPAGATION STUDIES 


Since electromagnetic waves with frequencies above 10 GHz are 
severely attenuated by liquid water, satellite systems using these 
frequencies must be designed to tolerate a few dB of attenuation which 
may be experienced for long periods and to switch to a diversity | 
ground station on those rare occasions when a very large attenuation 
is caused by excessive rain. Also the diversity ground station must be 
far enough removed so that there is little likelihood that an intense 
rainstorm will cover both ground stations at the same time. 

If the attenuation statistics on the two paths are uncorrelated and 
enough margin is provided to make both outages small, the system 
outage time can be reduced to a satisfactory degree. In general, this 
requires a more detailed and a more quantitative knowledge of rain- 
fall and attenuation caused by rain than presently exists. Accordingly, 
measurements have been undertaken at Bell Telephone Laboratories 
to determine for terrestrial paths and for ground-to-satellite paths, 
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the constraints on system design imposed by this natural phenomenon.* 
These programs are discussed elsewhere; only a summary is attempted 
here. 

Measurements of rain made at the earth’s surface are not sufficient 
to determine attenuation along a satellite-to-ground path, but they 
can indicate the amount of structure characteristic of a typical rain- 
storm. A sample contour map of instantaneous rain rate as measured 
by the Crawford Hill rain gauge network is shown on Figure 1.? 
These data form a snapshot of a specific area at a given moment. It 






























































Fig. 1—Sample scan of Crawford Hill rain gauge network for August 25, 
1967 at 20:43 EST, contours in mm per hour. 


2118 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1968 


was selected from many hours of data to illustrate one of the most 
intense rainstorms observed during a one-year period of measurement. 
The data illustrate that the most intense rain occurs in very limited 
cells and that rain which covers large areas (square miles) falls at the 
rate of one inch per hour or less. These are the considerations behind 
the proposal to use diversity ground stations separated by several 
miles. 

A more quantitative determination of attenuation statistics is being 
made with a radiometer which tracks the sun. This installation, which 
has been described by Wilson,? measures the noise received from the 
sun at 16 and 30 GHz and thus determines the attenuation caused by 
any intervening precipitation. Figure 2 is a photograph of the Craw- 
ford Hill sun tracker. At night the radiometer is pointed to a cold 
part of the sky and detects precipitation as an increase in effective 
temperature. Good accuracy is obtained to about 35 dB of attenua- 
tion, when tracking the sun, and to about 10 dB when the cold back- 
ground is used. Thus we are accumulating data which will be helpful 
in the design of domestic satellite systems, especially those operating 
above 10 GHz. 

Unfortunately, if one is to obtain a statistically meaningful result, 
operation must include at least a year in order to encompass all of the 





Fig. 2— Crawford Hill sun tracker. 
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iene 3— Recording by Crawford Hill sun tracker of shower on December 12, 
67. 


seasons. And this cannot be hurried. When we have obtained enough 
data to be worthy of statistical analysis, it will be published; data 
obtained over a shorter interval must necessarily be regarded as in- 
complete. With this qualification, we present the following data and 
observations which, though limited, are relevant: 


(2) Attenuation exceeding 30 dB at 30 GHz (about 10 dB at 16 
GHz) and lasting 6 to 8 minutes has ben observed on several occa- 
sions. Figure 3 is a strip chart recording showing such an occurrence 
along with a plot of the simultaneous attenuations at the two fre- 
quencies. Table II summarizes the data obtained for about 10 months 
through August 9, 1968. 

(it) These events are associated with intense rain cells (precipita- 
tion exceeding 100 millimeters per hour). 

(tit) Heavy overcast and even light drizzle produce little attenua- 
tion (less than 2 dB at 30 GHz). 

(tv) Attenuation by fog and snow is no problem at these frequencies. 

(v) Since a second sun tracker is not yet operating, we have no 
direct data on diversity advantage. However, data from our rain gauge 
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TasLE [J] — Data From Crawrorp Hitt SuN TRACKER 





Attenuation (dB) Percent of total observing time 
at 30 GHz* 3091 daylight hours 
>3 1.93 
>6 1.02 
>9 0.59 
>15 0.32 
>21 0.18 
>27 0.11 
>33 0.08 
at 16 GHzt 2520 daylight hours 
>1 1.29 
>2 0.75 
>3 0.49 
>5 0.28 
>7 0.19 
>9 0.14 
>11 0.12 
>13 0.08 


* Data from October 14, 1967 through August 9, 1968. 
+ Data from December 8, 1967 through August 9, 1968. 


network indicate that heavy rains are highly structured and are un- 
correlated at distances greater than a few miles. 

In addition, a variety of locations representative of all of those 
areas in which ground stations will be located should be investigated. 
The present sun tracker is located in New Jersey, which is only one 
of many possible locations, but which does have the advantage of 
being climatically intermediate between the extremes represented by 
parts of the gulf coast area and the western desert regions. 

Since the sun moves continuously only a small time is spent looking 
in a given direction; on the other hand, at least some data on many 
possible satellite paths are being obtained. At best, these data will 
leave many questions unanswered, but we will have some quantitative 
data on which to base our system designs. More locations, and par- 
ticularly the advantage obtained by using two or more diversity 
ground stations, must be studied. 

A beacon in synchronous orbit would be an ideal source for propa- 
gation experiments, but if it is to be worth the cost, a satellite experi- 
ment must have some advantage over a sun tracker. A stationary 
satellite with a highly coherent source of adequate power output 
would make possible simpler and therefore cheaper ground stations. 
Because it would be stationary, the satellite would not require precise 
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tracking as does the sun. Tracking the sun is simple but does com- 
plicate the installation and costs money. Use of a satellite would make 
possible more observation points for the same total ground station 
cost and thus enable us to gather design data more quickly. A highly 
coherent source, in contrast to the thermal noise emitted by the sun, 
would ease the measurement of the phase coherence of received signals 
over very large bandwidths. This is important because the system 
model requires very broad bandwidths (300 to 400 MHz) and cor- 
respondingly short pulses. 

Direct evidence that satellite radio channels will support such 
transmission is scarce and more is needed. Considerable indirect evi- 
dence is available, however. A relatively short (12.5 km) terrestrial 
path has been studied at 12 GHz for over a year with only one in- 
stance of moderate multipath activity; a second path 6.5 km long 
was also observed at 18 GHz for more than a year with no evidence of 
multipath fading. Very large narrow beam antennas have been used 
for radio astronomy at 30 GHz with no significant beam broadening 
observed.® Measurement with an array at 30 GHz has shown that the 
phase front of the incoming wave is not badly distorted.® 

Scattering from refractive index inhomogeneities in the atmosphere 
could conceivably introduce time delays which would restrict band- 
width, but this same phenomena would broaden antenna beams and 
reduce antenna gain. Probably the most sensitive indicators are the 
wide base interferometers used for radio astronomy; some have ob- 
tained resolutions of a fraction of a second of arc. All of the above 
applies to relatively high elevation angles, say, above 15°. It seems 
certain that all of the difficulties associated with long terrestrial paths 
will be experienced as the satellite path approaches grazing incidence. 
The problems which remains is to make this relationship quantitative; 
this would be most easily done using a highly coherent source on board 
a satellite. However, if we are to obtain data useful for system design, 
the satellite must be continuously available to insure that we do not 
miss any data, and it must be available over a long enough period of 
time to insure that statistics derived from the data are reasonably 
stable, that is, seasonal and other variations are included. 


IV. INTERFERENCE CONSIDERATIONS 


A radio system designed for maximum efficiency will be interfer- 
ence limited. This is true because doing so makes best use of frequency 
space, geometry (which in the case of a synchronous satellite system 
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includes orbit space), and the investment in facilities. An interference 
limited system is the end result of trying to get the maximum possible 
communication with a given limited resource. We assume that the 
required technology has reached the state where thermal noise can be 
reduced to a small value compared with interference at least during 
normal propagation conditions. Calculations of carrier-to-noise ratios 
made in this paper (see Table I) verify that such an assumption is 
warranted. 


4.1 Antenna Patterns 


The statement that a radio system is interference limited is equiv- 
alent to saying that the radiation patterns of the antennas used are 
of crucial importance; hence, we start with these. As in other areas, 
we look at current practice, estimate what may be possible in the 
future, and try to land somewhere between. 

Patterns for an antenna of current design are compared with 
greatly simplified relationships in Fig. 4. We see that the microwave 
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Fig. 4—Side lobe levels of microwave pole line periscope antenna. Beam- 
width = 2.5°, 6. = 1.25°. Vertical lines show range of envelope of side lobe levels 
for both polarizations. 
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pole line antenna* is almost everywhere better than F(6) = [1 + 
(0/89) ?->]-1, where 

6 = angle off the peak of main lobe 

6 = 4of the 3 dB beamwidth 


F(6) = antenna response (power) 


which we understand has sometimes been used in studies by the 
International Consultative Committee on Radio. In fact, the micro- 
wave pole line antenna follows the (0/8)* curve down to about 
—25 dB. On Fig. 5 we have plotted theoretical response curves for 
antenna apertures with specified variations in the amplitude of the 
illumination together with a uniform phase. We see that the enve- 
lope of a cos® [(z/2)x], where N = 1, distribution equals or exceeds 
the discrimination predicted by the (0/0))* curve and that for all 
higher values of N the (6/0)* curve is greatly exceeded. But, of 
course, these curves are not for real antennas. However, in Fig. 6 we 
see the envelope of the pattern of a horn-reflector antenna using a 
special dual mode feed designed by R. H. Turrin.” In this case the 
pattern discrimination provided by a real antenna considerably ex- 
ceeds the (0/6))* curve for all levels of discrimination and both 
polarizations.} 

For our present calculations we use the relationship, F(6) = [1 + 
(0/8)*]-+, with the expectation that a multibeam antenna with a 
reasonable amount of illumination taper designed to reduce or elim- 
inate aperture blockage by the feeds will be able to do at least this 
well. We think that the data presented above makes this a reasonable 
assumption. 


4.2 Interference at Satellite 


Our system model assumes that each ground station communicates 
with each satellite and vice versa. If we consider the situation at the 
satellite antenna feed corresponding to a desired ground station, we 
see that it will receive power from the desired station through its main 
lobe and from all other ground stations through its side lobes. How- 
ever, with a fourth power variation in antenna response, the nearest 
neighbors will contribute most of the interference. Thus if we visualize 

* This is a well-shielded antenna designed for a specific system. It was sug- 
gested by A. B. Crawford and measured by R. H. Turrin. (Unpublished work). 


+ The problem which remains in this case is that no one knows how to build 
a multibeam horn-reflector antenna. 
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a rectangular grid of uniformly spaced ground stations, there will be 
stations stretching out to the north, south, east, and west with spac- 
ings 1, 2, 38, ... times unit distance. The distance from the desired 
ground station to another ground station having coordinates (n, k) 
will be (n? + k?)%; hence, the total interference is 


Fe) = > E + (2) 


nk 












































0.001 189 dB 





Fig. 5— Calculated antenna pattern for cos"[(7/2)z] illumination of square 
aperture. a is aperture opening. @ is angle with normal to aperture. u = (7/A) 
a sin 6. 
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Fig. 6— Comparison of measured antenna discrimination with assumed value. 


When, as in the present case 


(0,,/0)* >> 1, and 0. = SQ +H) 


where 
S = uniform spacing between ground stations 
R = distance to the satellite 
n,k, = 1,2,3,---. 

Then 


ro - £[(@)T - E[G ee") 


Oye ee nh gut a | 
ager 

R R 
Hence, the total interference caused by all of these interfering ground 
stations will be 5.94 times or about 7.7 dB up on the interference 
produced by a single nearest neighbor. 

As applied to the present case and as assumed for the calculations 
made in support of Table I where a C/I ratio of 39 dB (net) at the 
satellite was used, a distance of 264 miles with a 10 meter satellite 
antenna was obtained. These computations are (see Fig. 7): 


F(8) = Ay or F(6)7? (22) 
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For C/I = 39 dB (all sources), 39 dB + 7.7 dB = 46.7 dB (one 
source) thus 


4 
(42) =10°7, 22 = 19 = 150. 
0 0 


For up link f = 30 GHz, ) = 1 cm, and assuming a 10 meter satel- 
lite antenna: 





dr 1 cm 
20) = 1.22 = 1.22{ 5 | 
20, = 1.47 X 10°° rad. 
6, = 0.735 X 107° rad. 


Aé = 15.08 = 11.0 * 107° 
S = R(A6) = 24 X 10°(11.0)10° 
S = 264 miles. 


4.3 Interference at a Ground Station 


Again we visualize a grid of uniformly spaced ground stations, 
each of which communicates with every satellite.* Thus a given 
ground station will be surrounded by beams from the satellite to all 
of the other ground stations, and the minor lobes associated with 
these satellite antenna feeds will crosstalk directly into the main 
beam of the ground station since it is “looking” at the desired satellite. 
As before, with a fourth power variation in antenna discrimination 
versus angle, most of the interference will come from the nearest 
neighbors. The analysis for this case follows the previous one for 
the interference at a satellite with nearly the same result except 
that at the longer wavelength (20 GHz) there is slightly less antenna 
discrimination. As applied to the present example—a 10 meter antenna 
on a Saturn launched satellite—we obtain the required spacing for 
the ground stations of 275 miles, as follows (see Fig. 8): 


F(6) = [1 i (aey']” or F(@)"? (22). 


* Obviously this is an idealized model. Even if the ground stations were 
located in a regular pattern, which is not at all likely, the pattern would appear 
distorted to the satellite. An accurate calculation must be based on actual loca- 
tions of ground stations which are at present unknown and north-south spacings 
will depend on station latitude; the idealized model is adequate for our purpose. 
No advantage has been taken of polarization. This is deliberate since a quantita- 
tive evaluation is not available at this time. We leave any polarization advantage 
for use in solving unforeseen problems. 
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Fig. 7 — Interference at a satellite. 


For C/I = 33 dB net, 33 dB + 7.7 dB = 40.7 dB for one source, thus 


4 
(42) = 19%", (42) = 10°? = 10.4. 
0 i) 


For the down link f = 20 GHz, \ = 1.5 cm, and assuming a 10 meter 
satellite antenna: 


op = {99S 1.22(25 om) 


a 830 em 
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Fig. 8 — Interference at a ground station. 
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6) = 1.10 X 10°’ rad. 

Ad = 10.46 = 11.45 X 10°* rad. 

S = R(A6) = 24 X 10°(11.45 X 107°) 
S = 275 miles. 


I 


4.4 Modulation Techniques 


The use of different look angles to achieve discrimination by an- 
tenna pattern is one important means for achieving co-channel opera- 
tion in a limited geographical area. Another is choice of a modulation 
technique resistant to interference, particularly co-channel interfer- 
ence. The possibilities here are many, at least in theory. However, 
when, as in the present case, one is considering channels having bit 
rates of several hundred megabits per second, complicated encoding 
schemes—especially those which require appreciable storage—are 
not worth the cost and complication. In the present case, as is indi- 
cated in Table I, we have chosen four-level over binary PCM even 
though the latter is more resistant to interference.* 

While the increased resistance to interference provided by two 
level modulation is attractive, our studies indicate that the possibility 
to double the channel capacity in approximately the same bandwidth 
by using four levels is well worth its cost. It is conceivable that even 
more levels would be advantageous, but the return diminishes at 
such a rate that specific situations must be studied to be certain. 
These studies are being continued. 

One of the features of quantized modulation is that the signal can 
be regenerated at intermediate repeaters in which case any errors 
made in regeneration accumulate along the system, but thermal noise, 
interference from crosstalk, and imperfections in repeater character- 
istics do not. In the present satellite system configuration, the satel- 
lite-to-ground or “down-link” is the most difficult from both the 
standpoint of thermal noise and interference. This comes about be- 
cause we can use a powerful enough transmitter at the ground station 
to make thermal noise at the satellite receiver front-end negligible, 
and we can space the satellites far enough apart (1° in the present 
example) so that, with highly directive ground station antennas, 


* Buried in these statements are assumptions about the manner in which 
co-channel, adjacent frequency channel, and adjacent time slot interference 
impair a signal. These problems are being intensively studied and will be 
reported elsewhere. 
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interference at adjacent satellites is small. Under these circumstances 
it is not worthwhile to regenerate on board the satellite. 

However, if in the future synchronous orbit space becomes so 
crowded that we are forced to tolerate the same interference on the 
up-path as on the down-path, regeneration will be very much worth- 
while. If we regenerate, the error rate would increase from, say, 1 
in 10’ to 2 in 10’ for both links, but if we do not regenerate, the error 
rate would increase from 1 in 10% to 1 in 10. However, under actual 
operating conditions the vagaries of propagation will affect the re- 
ceived signal level and under some conditions can adversely affect 
the signal-to-interference ratio. Thus an actual system must be de- 
signed with some margin for changes in signal level, and even a few 
dB of margin will make regeneration on board the satellite much 
less profitable. Regeneration at each intermediate repeater is most 
useful with very stable media such as coaxial cable or waveguide, 
where changes in transmission loss are small and systems can be de- 
signed to operate continuously at the specified error rate. 


V. THE SATELLITE 


Satellites of the size and complexity envisaged here have never been 
built, but there is no particular reason why they cannot be. Once 
launched, the environment in which the repeater must operate is in 
many ways more benign than here on earth. In the early days, our 
ignorance of the space environment clouded our vision and obscured 
this fact. But experience over the past several years with communica- 
tion and other satellites has encouraged consideration of larger and 
more sophisticated spacecraft. Reliability remains a problem, but 
this is not peculiar to spacecraft since present day terrestrial com- 
munication equipment must also be designed to be maintenance free 
for up to ten years because of cost. 

Design of a radio repeater for satellite use has much in common 
with design of its terrestrial counterpart. This is particularly true 
where, as in this case, such large high-gain antennas are used on 
the spacecraft and at the ground station that the loss on a satellite 
path is not much larger than is encountered on typical terrestrial 
paths. Using this approach, a satellite repeater can be of rather con- 
ventional design, but with special attention to efficiency in the use 
of de power and reliability. Integrated microwave circuits and solid- 
state devices would be used throughout except for the traveling wave 
tube final RF amplifier, 
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A repeater of this type is estimated to weigh 314 pounds and to 
require 18 watts of unregulated power from the solar supply for an 
RF output of 2 watts per channel. To this must be added 2 pounds 
per ground station (8 RF channels) to allow for diplexers and chan- 
nel dropping and recombining networks. In addition the pro rata 
share of power supply weight will be at the rate of 2 watts per pound 
as discussed in Section 5.2 for a total of 9 pounds for 18 watts to 
power one transponder. Thus 8 transponders to serve one ground 
station will weigh 8 (314 pounds) for electronics + 8 (9 pounds) for 
power + 2 pounds for filters = 100 pounds. 


5.1 Stabilization 


When very narrow antenna beams are used, as in this design, precise 
stabilization of the spacecraft is crucial. The first task which must be 
accomplished is sensing the attitude of the spacecraft with a high degree 
of accuracy and reliability. In this discussion we assume that crude 
orientation, that is, within a degree or so about all three axes, is ac- 
complished by present methods. More precise sensing for final attitude 
adjustment to +0.01° (0.2 X 107° radians) can be achieved by radio- 
metric means. For example, at the upper frequency of 30 GHz, antennas 
spaced a distance a = 10 meters will receive signals differing in phase 
by w radians when the platform tilt ¢ = \/2a = 107° meters/20 meters 
= 5 X 10° radians. A comparison circuit should be able to detect a 
difference of 7/100 radians or a tilt angle of 5 X 10~° radians. This is 
about two orders of magnitude smaller than the allowable variation 
and should be adequate for precise attitude control.* 

If we use one-meter antennas on the spacecraft for this purpose, 
reduce the receiver bandwidth to 400 kHz, and keep the ground 
transmitter power unchanged at 10 watts, the carrier-to-noise ratio 
in this sensing channel will be 50 dB. This is more than enough, but 
the consequences of losing attitude control are severe and plenty 
of margin is in order. Another basic aspect of attitude control is 
the amount of fuel (weight) required to stabilize a spacecraft for 
a long period, say, ten years. 

This phase of the problem has been studied from two complementary 
points of view: (7) a study of the basic effects which perturb the 
satellite orbit? and (i) a rudimentary station keeping and attitude 
control subsystem design. The first study indicates that an impulse 





* An ingenious means for keeping most of the complication of the attitude 
sensing and control system on the ground has been given by C. C. Cutler in 
Ref. 8. 
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of 157 feet per second per year would correct for changes in the 
inclination of the orbit of 0.86° per year caused by lunar and solar 
gravitation plus the lesser effect of a westward acceleration of 1.68 X 
10-* degrees per day resulting from the nonsphericity of the earth’s 
gravitational field. Attitude control, even to the very precise limits 
assumed here, appears to be primarily a question of the reliability 
of components to be used in a system with a design lifetime of many 
years. 

In comparison with the amount of fuel required for station keep- 
ing, the weight of fuel required for attitude control is negligible, 
that is, a few pounds per year per ton of satellite weight. Thus 
in the design study it was assumed that an impulse of 160 feet per 
second per year would be required for station keeping. Our conclusion 
from these studies is that the weight penalty imposed on a satellite 
having a ten year operational lifetime while significant is not pro- 
hibitive. 

The very difficult engineering problems which remain to be solved 
concern reliability in the space environment. We estimate that the 
hardware plus fuel required to correct an initial velocity error of 
100 meters per second, keep the satellite on station and stabilize its 
attitude for ten years invokes a weight penalty of about 16 percent 
of the total weight of the spacecraft. Thus Wtota1 = Wopaytoaa + 0.164 
Weotar OF Weotar = 1.20 Woaytoaa, and this is the relationship used here 
to calculate the total spacecraft weight once the weight of the com- 
munication equipment, solar power supply, and other necessary com- 
ponents has been determined. 


5.2 Solar Power Plant 

Values of 15 to 20 watts per pound have been reported for sun- 
oriented solar cell arrays, but 10 watts per pound is closer to present 
performance. We use two watts per pound. This is reasonable for an 
unoriented array and very conservative if, in fact, it is decided to 
point the solar power plant at the sun. Since a single two watt trans- 
ponder will require nearly 18 watts total solar power, power for each 
transponder will require a weight of nearly 9 pounds as its pro rata 
share of the solar power plant weight. 


5.3 Structure and Antenna 

An allowance of 20 percent of the total payload weight will be 
assumed for satellite structure and a like amount for the associated 
communications antenna together with its feeds. Admittedly this is a 
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guess based on previous experience and would require an innovative 
design to realize a lightweight antenna operable at 30 to 40 GHz 
and able to survive the launch environment. However, once the 
satellite is in orbit and on station, and this is the only condition 
under which it would be used, the disturbing forces will be very small. 

As already mentioned, an attitude and orbit control system capable 
of keeping a satellite on station and properly oriented for 10 years 
will require a total in-orbit weight of Weotar = 1.20 Wpaytoaa. Thus, the 
payload weight required for eight broadband channels to serve one 
ground terminal is 


Woaytoaa = 100 Ibs. for eight two-watt transponders 
+ 0.2 Woayicaa for structure 
+ 0.2 Woayicaa for antenna. 


thus 


Woayloaa = 167 lbs. per ground station 
and 


W totat 


1.20 (167) = 200 lbs. per ground station. 


VI. EARTH STATIONS 


As already pointed out, a terrestrial terminal will consist of two 
or more diversity locations each equipped with a multibeam antenna 
suitable for both transmitting and receiving. The number of diversity 
locations required, and the distance between them will be determined 
by the specified system reliability and the characteristics of rain- 
storms in the particular location being considered. Much more exten- 
sive (geographically) and complete data on space diversity will need 
to be obtained before such installations can be designed with con- 
fidence, but data available from the Bell Laboratories rain gauge 
network and other sources?® make it reasonable to postulate that two 
sites separated by 10 to 20 miles will be adequate, and this is the 
assumption used in the following. Actually a study of data of the sort 
shown in Fig. 1 leads one to hope that a considerably smaller distance 
would suffice. 

Provision of the first spare transmission facility is no great hardship 
since this is needed in any case to provide for maintenance and for 
protection against apparatus failure, but the diversity site must be 
connected by a broadband facility adequate to carry all of the traffic 
received or transmitted by either ground station. In the model system 
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this will vary from about 80,000 voice channels for a station working 
with a single satellite up to about four million voice channels for 
one that works with 50 satellites. 

Since the most attractive transmission medium for such large vol- 
umes of traffic appears to be a millimeter waveguide of the type 
described by 8. E. Miller, a facility of this sort is assumed as the 
site interconnecting link. A fully implemented terminal handling 
four million voice circuits would require eight pipes of two-inch 
inside diameter, each carrying 60 two-way 630 megabits per second 
channels. It is expected that for distances up to 15 or 20 miles between 
diversity ground stations no intermediate repeaters would be required 
when a low-loss waveguide is used. Concentration of the electronics 
at two sites which presumably would be attended at Icast part time 
should simplify maintenance and increase reliability. This site inter- 
connecting link, with its large volume of traffic to be carried over a 
short distance and without a need to demultiplex the individual voice 
circuits at either end, is a good candidate for an optical transmission 
link when technology has progressed far enough to make such an 
installation feasible. 

The proposed ground station antenna is of the tyne being studied 
by A. B. Crawford and T. §. Chu.!? This is a multibeam antenna 
10 meters high by 17 meters long desisned for up to 10 feeds for 
use with a like number of satellites. The common reflector is so 
designed that working in conjunction with the multiple feeds, good 
antenna patterns can be maintained over a 10° change in the azimuth 
angle but with no provision for variation in declination angle. The 
latter is obtained by tilting the entire assembly to the declination 
angle appropriate to the particular site. 

As already mentioned, these systems will be interference limited, 
hence good antenna discrimination is more important than gain, and 
the antenna design takes advantage of this fact. In the long run elec- 
tronically steerable arrays may prove to be the best and most econom- 
ical solution to the multibeam antenna problem both on board the 
satellite and at the ground station. In fact, if crowding of the syn- 
chronous orbit eventually makes it imperative to use inclined 24-hour 
orbits as described by H. E. Rowe and A. A. Penzias, steerable arrays 
appear to be the only workable solution.** At the present time, how- 
ever, multiple feeds used with a focusing reflector appear to be the 
most attractive solution. 

As indicated on Table I, an over-all effective noise temperature 
of 150°K is assumed for the ground terminal receiver; this can best 
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be achieved by using a cooled parametric amplifier. Considerably 
lower effective temperatures are possible. But with an interference 
limited system operating at 20 GHz where the effective antenna tem- 
perature will approach 300°K under conditions of heavy rainfall, 
little improvement in actual system performance would be obtained. 
Although Table I specifies a 10 watt 30 GHz transmitter which 
provides more than 40 dB C/N at the satellite (and this may well be 
adequate), increased transmitter power is feasible and would not be 
very expensive if further propagation data makes this appear nec- 
essary. 


VII. LAUNCH CONSIDERATIONS 


The cost of placing a given weight in orbit can be obtained for 
many different combinations of upper and lower stages. Costs dif- 
fering by nearly 10 to 1 will be found, depending on the time, scale 
of production, size of launcher used, and so on. Such a wide variety 
of combinations of lower and upper stages including future exotic 
high energy upper stages and injection rockets are technically pos- 
sible (some are parts of on-going funded and scheduled programs 
while others exist only on paper), that large variations in load capac- 
ity and cost estimates are to be expected. 

The time scale being considered is an especially confusing factor; 
things are always predicted to be much better in the future. In 
thinking about the present system model, particularly satellite vs 
ground terminal size and complexity, it has been assumed that launch 
cost is directly proportional to payload weight. How realistic is this? 
Rocket capabilities tend to be quantized, and certainly one must pay 
the entire launch cost if it is made for his benefit. However, if we 
take a longer range view, a wide variety of combinations of lower 
and upper stages will become available. 

For example, the basic Titan 3-C will place 2,140 pounds in syn- 
chronous orbit, but various combinations of lower and upper stages 
based on Titan have been proposed which would place 650, 2140, 
2450, 3450, 5800, 9000, 18,000, and even 23,000 pounds in synchronous 
orbit. It is true that many of these combinations will require con- 
siderable research and development before a reliable launch vehicle 
results, but at least in concept it is possible to tailor the rocket to 
the job. In addition, we have the possibility, now amply demon- 
strated, of combining several small payloads into one for multiple 
launch. Combinations of these two techniques should eventually 
make it possible to match the rocket and its payload quite precisely. 
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VII. SUMMARY AND CONCLUSIONS 


It has been argued that by taking full advantage of modern space 
and communication technology and assuming that very broad fre- 
quency bands in the range between 15 and 40 GHz can be allocated 
to this service, realization of a domestic satellite system with con- 
siderable communication capacity is feasible within the next decade. 
Some of the parameters of satellites designed for such a system are 
given in Table III. 


Major unresolved technical problems are reliability of the propagation 
of these very short radio waves through the earth’s atmosphere and 
stabilization of a space platform to +0.01° for up to ten years. Data 
adequate to provide a solid statistical base for predicting system per- 
formance are not available, but a start has been made and more ex- 
periments are planned. Several programs are dedicated to improving 
performance of satellite attitude stabilization systems and achievement 
of the required value has been predicted. A preliminary study of the 
basic forces acting on a satellite shows that attitude stabilization even 
to these very precise limits does not require prohibitive amounts of 
fuel. However, a considerable advance in technology will be required 
before an attitude stabilization subsystem of adequate reliability and 
precision is available. The very broadband radio repeaters which are 
required to implement this concept are less of a problem since these are 
already being developed for other purposes and can be adapted for 
satellite use. 


TaBLE IT] — SuMMARY OF SATELLITE PARAMETERS 


Number of ground stations served 


8 16 32 50 

Weight of satellite (pounds) 1640 3280 6560 11,500 
DC power (kw) 1.15 2.30 4.60 7.20 
Transponders 64 128 196 400 
Gigabits through satellite 40 80 160 252 
One-way equivalent voice 

circuits through satellite 

(thousands) 640 1,280 1,960 4,000 


One-way equivalent TV 
channels through satellite 
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In addition to the problems peculiar to the present proposal, there 
are, of course, those common to all synchronous satellites: an ap- 
proximately %-second absolute round-trip delay and the attendant 
echo suppression requirement at a cost which is not negligible, slow 
delay variations resulting from path length changes which may 
amount to several microseconds, service outages caused by interrup- 
tion of solar power (or provision of storage batteries), and outages 
caused by sun transit, which will increase the 20-GHz receiver effec- 
tive noise temperature from the assumed 150°K to about 15,000°K 
(20 dB). 

Assuming a favorable outcome from the propagation measure- 
ments, particularly a demonstration that common carrier standards 
for reliability can be achieved with diversity ground stations, plan- 
ning for such a system could begin immediately. Other requirements 
of a nontechnical nature include resolution of the frequency alloca- 
tion problem and policy decisions regarding system ownership and 
operation. 
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Path Length Variation in a Synchronous 
Satellite Communications Link 


By ANTHONY G. LUBOWE 
(Manuscript received April 30, 1968) 


The path length and path length rate variations in a communications link 
connecting earth stations on the East and West coasts of the continental 
United States by means of an equatorial synchronous satellite are investi- 
gated numerically and shown to be less than 50,000 feet and 5 feet per second 
for periods of at least a day (and likely for a week or more) with no station- 
keeping during the periods of interest. 


I, PROBLEM 


The use of a synchronous satellite in a communications link with 
time division switching for digital channels may introduce problems 
because of the path length variations resulting from the perturbation 
of the satellite orbit. For instance, a path length variation of 0.02 
percent (10 miles for a typical satellite-tracker configuration) cor- 
responds to a timing change of 0.05 ms. The rate of change of path 
length (length rate) is also of interest. The object of this paper is not 
to apply the results to current problems, but to make the results 
available to prospective users. 

Analyses of the synchronous satellite problem’? have emphasized 
the long period motion. However, the relatively small short period 
perturbations have important effects upon path length and length 
rate. Also we expect the long period motion to be handled by discrete, 
rather than continuous, station keeping (perhaps by a sequence of im- 
pulsive maneuvers every few days) so that the major orbital perturba- 
tions will be those of short period. 

We assume an equatorial orbit, with the longitude fixed by the 
communication requirements at 100°W. We must specify the initial 
altitude and velocity so that the satellite is synchronous. If we assume 
that the earth’s inverse-square gravity results in a circular equatorial 
orbit with mean motion equal to the earth’s rotation rate, then it can 
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be shown that perturbative forces such as the Jo, J3, J, zonal har- 
monics and the Jo. tesseral harmonic introduce radial and tangential 
forcing terms (e€ and n) into the equations of motion.? 


II, ANALYSIS 


If the initial satellite longitude is equal to the longitude of the 
major axis of the earth’s elliptical equator, then the initial radius, 
7s, can be chosen so e and n are both equal to zero (if higher order 
terms are neglected). If the initial longitude must be different, then 
n = 0 cannot be obtained and there will be drift away from the de- 
sired longitude. However, we can still obtain e = 0 by proper choice 
of r, (with little effect upon n) and this is the procedure we follow. 

We do not include the luni-solar gravity perturbations (which we 
include in our computer runs) in the choice of initial conditions, since 
these perturbations are time-varying in an earth-fixed frame and the 
details of their inclusion depend upon the interval of interest.1 Their 
effect is small, and the complexity they introduce into an operational 
procedure probably outweighs any reduction in the perturbations. 


TABLE I—Patru LENGTH AND LENGTH RATE VARIATIONS 


Date Imax — ImMin . imax — Imi 

(feet) (feet per second) 
I-01:00 16,200 1.45 
1-21:16* 30,700 2.69 
4—01:00 27 ,000 2.58 
4-20:21* 32,400 2.88 
7-01:00 35 ,600 3.40 
7-18:05* 31,800 3.12 
8-16:12* 40,700 4,24 
9-14:19* 38 ,900 4.03 
10-01 :00 f 36,100 3.71 
10-08 :00 12,600 1.27 
10-14:04* 35 ,900 3.56 
10-15:00 41,300 4,22 
10-—22:00 12 ,900 1.14 
10-29:00f 38 ,400 38.85 
11-12:14* 45 ,200 4.15 
12-12:03* 29,700 2.75 
12-22:00 18 ,900 1.65 


* New moon 
+ Full moon on 9-29:17 
{ Full moon on 10-29:10 
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Fig. 1— Path length variation for eight days starting 10-15 :00. 


III. RESULTS 


The results were obtained using modifications of existing computer 
programs to integrate the equations of motion of the satellite numeri- 
eally.® 

The ground stations are located at 38°N latitude and at 75°W and 
125°W longitude. The initial satellite longitude is midway between 
the tracker longitudes. We list some results in Table I. In each case 
the satellite has been injected into orbit, or the orbit has been cor- 
rected, so that the radial forcing function is zero on the date shown. 
(The dates are given as “month-day:hour,” so that 1-21:16 means 
January 21 at 4 p.m., Greenwich time. All runs were made for 1966 
since simple luni-solar position routines were available for that year; 
however, they should be quite representative of the variations to be 
expected.) 
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Fig. 2— Path length rate variation for eight days starting 10-15 :00. 


The maximum path length and length rate variations differ con- 
siderably over the solar cycle but even more over a lunar cycle. This 
is reasonable since the solar perturbation on a synchronous satellite 
is only 1/4 of the lunar perturbation. Peak-to-peak variations in 
L and L were less than 50,000 feet and 5 feet per second in all cases. 
The maximum variations during the lunar cycle occur around new 
moons and to a slightly smaller degree around full moons, and in 
the Fall of the solar cycle. The length variations shown in Table I 
result from all satellite position components. We notice that radial, 
longitudinal, and latitudinal components were in all cases less than 
2,500 feet., 0.03°, and 0.005°, respectively, and often considerably less. 

To see the effect of omitting daily stationkeeping, the satellite was 
allowed to move uncorrected for eight days beginning on a day of 
high and one of low variation. (See Figs. 1 through 4.) An interesting 
and rather unexpected result is that in both cases, the maxima for 
the one week runs are the same as for those one day runs with the 
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larger maxima, whether they occurred at the beginning or end of the 
week. Thus, one might tolerate stationkeeping once per week rather 
than daily if the main orbital constraints were peak-to-peak varia- 
tions in L and & of less than 50,000 feet and 5 feet per second. 
Another orbital constraint might be a maximum allowable longi- 


tudinal drift, say AA < 0.1°. The drifts are approximately linear for 
the first few days, that is, for 


Oct. 15 to 16, AX = 0.026° 
Oct. 15 to 23, Ad = 0.199° 


and for 
Oct. 22 to 23, AX = 0.0023° 
Oct. 22 to 30, AX = 0.0250°. 


Thus a constraint of AA < 0.1°, might require stationkeeping every 
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Fig. 3 — Path length variation for eight days starting 10-22 :00. 
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Fig. 4 — Path length rate variation for eight days starting 10-22 :00. 


3 or 4 days in a period of large variations, although the path length 
and length rate constaints would not be violated for at least a week. 
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Parametric Representation of Ground 
Antennas for Communication 
Systems Studies* 


By D. L. POPE 
(Manuscript received April 9, 1968) 


Mathematical models relating the gain, cost, diameter, frequency, and 
rms surface tolerance of ground antennas are developed for both exposed and 
radome-enclosed parabolic reflectors. Diameters considered range from 16 to 
500 feet, while frequencies vary from 1 to 100 GHz. Data from existing 
installations are used to develop standard cost vs diameter and rms surface 
tolerance vs diameter relationships. The standard cost 1s associated not only 
with diameter, but also with standard surface tolerance. Quality factors are 
introduced to relate deviations from the standard rms surface tolerances to 
expected departures from the standard cost curve. The models are completed 
by the inclusion of an approximate relation for the gain of parabolic 
reflectors. Isach model comprises two equations among five variables. 
Although they are relatively complex, these models should be taluable to the 
communication systems planner in considering the gross features of alterna- 
tive concepts for ground antenna installulions. They are intended as 
guidelines for the conceptual stages of communication systems development, 
and are especially useful in terms of the trade-off studies they encompass. 
Three examples dealing with typical questions of this type illustrate their 
use. 


I. INTRODUCTION 


Mathematical models relating five important variables encountered 
in the consideration of ground antennas for communication systems 
are developed for both exposed and radome-sheltered structures. The 
variables are: diameter, cost, gain, frequency, and rms surface toler- 
anee. Often, in communication systems studies, the costs associated 

* A preliminary version, “Trade-off Models for Ground Antennas in Microwave 


and Millimeter Wave Communication Systems,” was presented at the Fifth 
Space Congress, Cocoa Beach, Florida, March 1968. 
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with a ground antenna are estimated by using a simple relationship 
based on antenna diameter alone. The results of our study have a 
similar utility, but are substantially broader in scope and have greater 
flexibility. They represent some of the major features of ground 
antennas for use in the preliminary phases of communication systems 
planning. 

In spite of the refinements offered by the present models in com- 
parison with previously available guidelines, parameters which are 
important in the design and operation of a specific ground antenna 
system are not included. For example, there is no provision for in- 
cluding antenna noise temperature. Hence, the comparison of con- 
cepts on the basis of signal-to-noise ratio is not possible. Similarly, the 
effect of aperture illumination and the side lobe levels to be expected 
are not considered. There are other factors which must be accounted 
for in the design process but do not appear in the present formuation. 
For any specific application, performance requirements are carefully 
defined and vary considerably depending on the intended use of the 
antenna. It remains a challenge to the antenna specialist to optimize 
his design in order to meet those performance requirements in the best 
possible way. The present models, which characterize antennas in 
terms of only a few of their gross features, cannot and should not be 
expected to apply at such levels of refinement. 

In this study, diameters range from 15 to 250 feet for the exposed 
antennas and 30 to 500 feet for antennas enclosed by a radome. Fre- 
quencies vary from 1 to 100 GHz. Only conventional reflectors are in- 
cluded. No consideration is given to actively controlled surfaces, 
multiple antenna synthetic apertures, or other such concepts which 
may be important in the future. 

The first relation introduced is Ruze’s formula relating gain to 
diameter, frequency, and rms surface tolerance. Two additional equa- 
tions which relate a standard rms surface tolerance to diameter and 
a standard cost to diameter are developed from information available 
on existing and proposed antenna installations. Both the standard rms 
and the standard cost relationships are based on the same specific data 
points within each class of antennas. The points used are believed to 
be consistent and span the diameter range of interest. The result of 
this interpretation is a standard cost relationship which depends not 
only on the diameter of the antenna, but also is related to its surface’ 
tolerance. 

The specific correlation of cost, diameter and rms surface toler- 
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ance is a novel feature of the present approach. Finally, quality fac- 
tors are introduced to associate departures from the standard rms sur- 
face tolerance with departures from the standard cost curve. The 
functional relations chosen to represent these factors are justified with 
largely heuristic arguments because the existing information does not 
permit a more precise determination. However, when the data that is 
available on cost and rms surface tolerance is adjusted to a common 
standard using the functions chosen, the resulting agreement is en- 
couraging. 

Each model consists of two equations among the five variables of 
interest. Although much more complex than the simple power law 
relationship often used to represent the cost vs diameter of ground 
antennas, these sets of equations contain considerably more informa- 
tion. Not only do they readily yield information about any specific 
case, but also they provide a starting point for various optimization 
studies. Three examples are given, dealing with maximum cost-ef- 
fective antennas, minimum cost antennas given the gain and the fre- 
quency, and the variation of gain for a specified cost and frequency. 
The first two examples are examined for both exposed and enclosed 
antennas. These examples suggest others that also could be done. 


II. THE PROBLEM 


The most elusive part of this study was determining satisfactory 
rms surface tolerance vs diameter and cost vs diameter relationships 
for ground antennas. Many reports were studied and several personal 
contacts made in the attempt to assemble sufficient information to 
draw the necessary conclusions. In spite of this effort, the functional 
relations which are ultimately suggested to represent standard cost 
and standard rms surface tolerance variations with diameter remain 
substantially empirical. It is worthwhile to discuss some of the rea- 
sons. 

The original hope was to process the available data on existing and 
proposed antenna structures by some statistically satisfactory tech- 
nique in order to determine the most likely functional forms for the 
relations of interest. This approach was abandoned for two reasons. 
In the first place, the number of antennas for which there is reliable 
information available on cost and rms surface tolerance is too small 
to admit a satisfactory statistical treatment. Secondly, cost and rms 
surface tolerance are vague and hard-to-define concepts which admit 
differing interpretations in each case. The lack of any common stand- 
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ards for reporting these quantities makes it unrealistic to determine 
functional relations involving such quantities by formal manipula- 
tions of the available data. 

It is not hard to understand the reasons for the ambiguities in the 
reported data. The problems involved in measuring the surface toler- 
ance of a large paraboloidal reflector are difficult, and to perform 
such a measurement is expensive and time-consuming. Often user 
demands on high performance antennas are sufficient to prohibit meas- 
uring the surface tolerance in any sort of a statistically satisfactory 
way. The measurement techniques themselves can place constraints 
on the structure, such as zero zenith angle and benign environmental 
conditions, which are unrealistic in terms of operational require- 
ments. The data reduction process to determine the tolerance figure 
reported may introduce extraneous variables or unsuspected biases 
influencing the result. In a few cases, the surface accuracy has been 
calculated by measuring the gain over a range of frequencies and 
then using Ruze’s gain equation, in which the rms surface tolerance 
is assumed to be the only unknown. This seems to be a powerful and 
effective technique, but it supposes a knowledge of the aperture ef- 
ficiency at each frequency, a quantity that is extremely difficult to 
determine independently.* 

Further ambiguity is introduced by neglecting to define the im- 
portant concepts carefully. The rms surface tolerance can be measured 
with respect to the best-fit paraboloid or to the original design con- 
tour. It often contains a systematic as well as a random component, 
which may or may not have been eliminated in the published value. 
It can, of course, be a deviation normal to the reflector surface or 
normal to the aperture plane, although the distinction is not especially 
significant for shallow reflectors. In a few cases, it is the maximum 
peak-to-peak deviations that are reported and an equivalent rms sur- 
face tolerance must somehow be found. 

The resolution of the cost associated with existing and proposed 
antennas, while not encumbered with the technical problems of sur- 
face tolerance determination, is beset with other difficulties. A high 
performance antenna is a custom-made item. The price reflects neces- 
sary research and development, engineering, tooling, and fabrication 
costs which are difficult to determine precisely and which cannot be 
spread over a large number of units. The requirements of each situa- 
tion must be dealt with separately. There are relatively few com- 
panies building such structures, and the competition is fierce, Pricing 
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information and guidelines are proprietary and are simply not avail- 
able to an interested outsider. 

A different sort of problem arises in the attempt to establish the 
costs of existing structures. A specific cost figure can generally be 
found for most of the antennas in operation today, but it is difficult 
to determine exactly what the reported number of dollars bought. 
There are numerous ancillary items with a ground antenna that may 
or may not be included: the electronics, feed structure, servo systems, 
data readout and transmission, power plants, land acquisition, sup- 
port buildings, heating, lighting, ventilation, and so on. Seldom is 
the reported cost broken down in sufficient detail. Meaningful cost 
comparisons cannot be made without knowing which subsystems are 
included in the reported cost and which are not. 

In view of such uncertainties regarding the costs and surface toler- 
ances of existing structures, and the relatively small numbers of such 
high performance antennas in operation, it is clear that a statistical 
approach to determining functional relations among the variables of 
interest would be illusory. Instead, the standard cost vs diameter and 
rms surface tolerance vs diameter relations are established by con- 
sidering only a few data points (3 for exposed antennas and 4 for 
antennas with a radome) which span the diameter range of interest 
and seem to form a consistent subset of the data available. 

The outcome of this line of reasoning is a pragmatic and qualitative 
model consisting of various relations among the variables of interest. 
It is not strictly defensible on grounds of statistical rigor, in spite of 
the rather satisfying way in which the available data are shown to 
fit within its structure. It is certainly neither unique nor absolute. Its 
usefulness lies in its reflection of acknowledged trends and its capacity 
as a basis for comparisons, trade-offs, and various sorts of optimiza- 
tion studies on ground antenna systems at a relatively coarse level. 


Ill. THE GAIN FORMULA 


In 1952, Ruze suggested a formula for the on-axis gain of a reflec- 
tor antenna.? This formula has been generally acclaimed and enjoys 
wide popularity in spite of the restrictive assumptions it incorporates. 
These assumptions were clearly restated by Ruze in his 1966 article.? 

Ruze’s formula states: 


2 2 
G= (7) exp — (2°) : (1a) 
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Here D is the reflector diameter, A is the wavelength at the frequency 
of interest, « is the rms deviation of the reflector surface from the best- 
fit paraboloid, and y is the aperture efficiency, a measure of the over- 
all electronic properties of the antenna. D, and e must be in con- 
sistent units. 

The leading factor in Ruze’s formula expresses the gain for a per- 
fect reflector. The effect of deviations from a perfect paraboloid are 
contained in the exponential factor. No distinction is made between 
manufacturing inaccuracies and deflections of the reflecting surface 
resulting from environment. The gain of a given antenna, with a 
specified diameter and surface tolerance, first increases as frequency 
is increased. However, a point is reached at which the exponential 
factor takes over, and a further increase in frequency results in a 
decrease of the gain. The point at which the gain is a maximum is 
called the gain-limit point. 

The same phenomenon can be noticed if the frequency is held fixed 
and the diameter is varied. The cause for a gain-limit point in diam- 
eter is not immediately apparent from equation la, but it occurs be- 
cause the rms surface tolerance is also a function of diameter. Stack 
has pointed this out in his work,’ and he gives curves of gain vs 
diameter for a number of frequencies.* 

The aperture efficiency, y, includes the effect of nonuniform illumi- 
nation, spill-over, aperture blockage, front-end losses in antenna elec- 
tronics and other factors which contribute to degradation in perform- 
ance. It specifically does not include the effects of an imperfect reflect- 
ing surface. For a properly engineered antenna, » should lie between 
0.65 and 0.75 (see Ref. 5). The aperture efficiency also depends to a 
certain extent on antenna configuration. In addition, the aperture 
efficiency depends on operating frequency for a given reflector. As a 
result of the uncertainty associated with the aperture efficiency, when 
equation la is used in this report, the aperture efficiency is simply 
taken to be 70 per cent. Refinements of this assumption would require 
information that is not available. 

Equation 1a requires modification in order to use it for antennas 
with a radome. Experience with operation of radome-enclosed anten- 
nas has been generally satisfactory at microwave frequencies. At 
such frequencies the radome is responsible for approximately 1 dB 
loss in gain mostly from aperture blockage.*® It also contributes to 
system noise temperature. The total system degradation depends to 
a considerable extent on local weather conditions. Consideration of 
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these effects is beyond the scope of this discussion. Further details can 
be found in Ref. 6. 

There is little experience with radomes at millimeter frequencies. 
In this regime, the radome thickness is no longer small with respect to 
wavelength and special design techniques will clearly be necessary to 
minimize losses. For the present, we assume that equation la can be 
modified appropriately by means of a multiplicative factor 


Ge roy o(®2) exp _(#) ] . (1b) 


The factor R(A) is chosen to represent the loss in gain caused by the 
presence of a radome, both because of aperture blockage and path losses 
in the radome. 

The gain calculated using Ruze’s formula (la or 1b) does not in- 
clude atmospheric effects that can degrade signal strength, such as 
turbulence or rain. These effects may be extremely important, parti- 
cularly at high frequencies, but are not explicitly part of the ground 
antenna considerations. 


IV. COST AND DIAMETER 


When the information available on the costs of ground antennas is 
plotted with the diameters, a substantial scatter of the data is evident. 
Even with logarithmic coordinates, the dispersion precludes a satis- 
factory straight-line fit to the data points over the entire diameter 
range. Such a straight-line fit in logarithmic coordinates would cor- 
respond to the familiar power law relation, cost = (constant) (diam- 
eter)”, A piecewise-linear cost function, corresponding to an increase in 
the power law exponent with diameter would be much better, but 
would introduce troublesome analytic complications. 

To establish the standard cost vs diameter relationship, we have 
selected three antenna structures which span the diameter range of 
interest, and have fit a three-parameter expression to these three data 
points. The antennas chosen are:* 

(t) The 15-foot antenna operated by Aerospace Corporation, El 
Segundo, California.” 

(wz) The 85-foot antenna operated by the Naval Research Labora- 
tory at Maryland Point, Maryland.’ 


* A typical reference is given for each antenna. There are several other sources 
describing each of the antennas. 
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(iwi) The 210-foot antenna operated by the Jet Propulsion Labora- 
tory at Goldstone, California.° 


All three antennas were manufactured by the same company, and 
all are exposed and fully steerable, although the first two have polar 
mounts and the third has an azimuth-elevation mount. Good rms sur- 
face tolerance information is available for all three. More importantly, 
the cost information obtained from user and manufacturer agrees 
reasonably well for all three structures. The costs cited include the 
structure, drives, and control, but do not include electronics, readout 
equipment, or other ancillary costs, insofar as could be determined. 
The standard cost-diameter relation obtained this way is 


$* = 6.7(10)°D7? exp (D/45). (2a) 


This curve is shown in Fig. 1. In equation 2a the diameter D is in feet. 
Potter’s power law curve’? for the 85- to 250-foot range is also shown 
in Fig. 1. 

Although the relation 2a fits the three selected points very nicely, 
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Fig. 1— Antenna cost vs antenna diameter for both exposed and radome en- 
closed antennas (standard curves). 
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Fig. 2— Radome cost vs radome diameter for rigid and air-supported radomes. 


problems can occur if unconscious extrapolation is attempted. Beyond 
210 feet, the costs increase very rapidly with diameter because of the 
exponential factor. There is a singularity at D = 0, and costs again 
increase as diameter decreases below 15 feet. The exponential increase 
in cost for very large antennas is probably not entirely unrealistic. 
However, relation 2a should be used only in a diameter range from 
15 to 250 feet. 

The cost-diameter relation for antennas with a radome is also shown 
in Fig. 1. In this case, the items included in, and excluded from, the 
reported cost are the same as for exposed antennas with one im- 
portant exception. The cost of the radome is included. A simple power- 
law relation seems to be satisfactory for these antennas over a 30- to 
500-foot diameter range. This relation is 


$* = 6.75(10)°D'**. (2b) 


Again, the diameter is expressed in feet. 
The estimated cost of the radome alone, including foundation and 
environmental control equipment, is shown in Fig. 2. Both air-sup- 
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ported and rigid space frame radomes are considered. This informa- 
tion is taken directly from Ref. 6 for the rigid radomes and from 
Ref. 11 for the air-supported radomes. Considerable extrapolation is 
required in both cases to cover the entire diameter range of interest. 
In addition, this data deals entirely with radomes designed to enclose 
antennas operating at relatively low frequencies. For the higher fre- 
quencies, special designs for the radome will have to be found in order 
to minimize losses and noise. Manufacturing and construction toler- 
ances will be substantially more stringent than those reflected in the 
prices represented by Fig. 2. Radome cost will clearly depend on the 
operating frequency as well as on diameter. However, since there is 
virtually no experience with high frequency radomes, even the ap- 
proximate nature of this dependence is unknown. In lieu of a more 
appropriate representation, the relations shown in Fig. 2 will be used 
in the present model. 

The diameter of the radome required to enclose an antenna of diam- 
eter D is assumed to be 4/3 D. The cost of the antenna alone can 
now be determined using both Figs. 1 and 2. 


V. SURFACE TOLERANCE AND DIAMETER 


The data available on rms surface tolerance of existing antennas 
is plotted in Fig. 3. The ranges shown with many of the data points 
are attributable to a number of factors. In a few cases, they reflect 
honest uncertainty. In others, they represent a range of reported values 
resulting from the different tolerances at different elevation angles, or 
under different environmental conditions. In some cases the range 
shown encompasses values reported from different sources for the 
same antenna. For one or two antennas, the range shown represents 
the design goal. 

The surface tolerance data shown represents the surface accuracy 
under operating conditions. Thus all factors that combine to produce 
mechanical deviations from a perfect paraboloid are included. These 
include manufacturing inaccuracies, as well as surface deflections 
caused by gravity, wind, and thermal effects. In general, the antennas 
represented are fully steerable, and operate satisfactorily in steady 
winds up to about 30 mph and other environmental conditions nor- 
mally expected for such antennas. 

The antennas used as a basis for the cost curves (Fig. 1) also de- 
termine the rms tolerance vs diameter curves. These curves are given 
by two straight lines; one for exposed antennas, and the other for 
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Fig. 3— Antenna rms surface tolerance vs antenna diameter for both exposed 
and radome enclosed antennas (standard curves). 


antennas operated inside a radome. The appropriate functional rela- 
tionship is of the form: 


ef = gD} (3) 
where ¢* is the standard rms surface tolerance in millimeters, and D 


is the reflector diameter in feet. Caution: take special note of this 
rather unusual juxtaposition of units. The constant a is; 


1.3(10)~* for exposed antennas, 
4.6(10)~* for antennas with a radome. 


a 


a 


The curve for reflectors protected by a radome has the same slope as 
the one for exposed structures but, at any given diameter, the surface 
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errors are considerably less for the enclosed structure because of the 
benign environment. 

Since the same points have been used to develop the standard ex- 
pressions, the cost-diameter relations (equations 2) give not just the 
cost of an antenna of diameter D, but specifically the cost of an 
antenna of diameter D with a surface tolerance given by equation 3. 
This correlation between the cost-diameter and rms-diameter curves 
is extremely important. The two relationships, taken together, express 
cost in terms of diameter and rms tolerance. The step from rms toler- 
ance to frequency is simple, thus the cost is implicitly related to diam- 
eter and frequency. The costs and rms surface tolerances defined by 
these curves are called the standard values, and are indicated by the 
symbols e* and $*, respectively. 


VI. QUALITY 


We now turn to a determination of the effects of departures from 
the standard curves. For example, how much can be saved by relax- 
ing the rms requirement at a given diameter? How much more will it 
cost to improve the surface tolerance at some given diameter? These 
questions are typical of many others, such as, is it better to increase 
diameter or surface tolerance to achieve desired performance?, and 
the possibilities of an interesting optimization study begin to emerge. 

To deal with these questions, we introduce quality factors to relate 
departures from standard rms surface tolerance to departures from 
the standard cost. This ingenious approach to the problem was first 
introduced by Stack.*}* The actual rms (e) and actual cost ($) can 
be expressed as 


e>= fie*, 
$ = f.$* 
where e* and $* represent the standard values as shown in Figs. 1 
and 3. For antennas with radomes, the cost appearing in these rela- 
tions is the cost of the antenna alone. The problem now reduces to 
an appropriate selection of the quality factors f, and fo. 
The functions chosen are 
f, = 1/a (4a) 
f. = exp (x — 1). (4b) 
The parameter x provides the connection between the quality factors. 
For x > 1, the actual rms surface error is less than the standard rms 
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error as given by equations 2. Conversely for x < 1, the surface is less 
precise than given by the standard relations. 


The range of the parameter is 0 S x S o. This range includes the 
possibility of achieving a nearly perfect reflecting surface by requiring 
x to be very large. Physically, of course, this is not possible. There exists 
a limiting tolerance, almost certainly a function of diameter, beyond 
which the surface accuracy can no longer be improved. Unfortunately, 
this limit is unknown. Equations 4 represent a compromise with this 
situation. Although equation 4a admits the possibility of infinite im- 
provement in rms surface tolerance, equation 4b associates an infinite 
cost with such an improvement. In fact, the cost factor f, expressed by 
equation 4b extracts a very heavy cost penalty for even modest rms 
surface tolerance improvement. In addition, expression 4b limits the 
possible reduction of cost to approximately 4 of the standard cost, 
regardless of the reduction of quality of the reflecting surface. 

The factor f; is the same as the one proposed by Stack.* However, 
the factor f. is significantly different. These factors are based on the 
realization that the standard curves of Figs. 1 and 3 represent very 
good reflecting surfaces indeed. It is reasonable to expect it to be ex- 
tremely expensive to improve the surface quality still further, while 
some saving should result if the standards of accuracy were relaxed. 
The particular factors chosen would probably not be applicable if we 
had selected the three basic antennas from which the standard cost 
curve is derived nearer the center of the spectrum of available pro- 
ducts. However, the three points actually used represent a definite bias 
toward the excellent, and this bias justifies the present choice of the 
quality factors. 

There is insufficient good raw data on the rms tolerance and cost 
of existing antennas to establish the quality factors directly. At least 
two reliable data points would be necessary for each of several differ- 
ent diameters in order to succeed. We would expect that the quality 
factors should also reflect the influence of diameter. However, the in- 
clusion of this effect could not be justified on the basis of the available 
information. 

It is possible to check approximately the quality factors chosen 
against the data that is available. Although the cost of each antenna 
considered is influenced by factors not included here (such as environ- 
mental considerations and tracking requirements), the comparison will 
be carried out on the basis of rms surface tolerance and diameter 
alone. A value of the parameter x can be determined by comparing 
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the actual and the standard rms values according to Fig. 3 and using 
the definition of f;. The appropriate cost factor fz can then be found 
from equation 4b. If the inverse of the calculated cost factor is ap- 
plied to the reported cost of the antenna, a revised standard cost is 
found. This figure represents the expected cost of the antenna, had it 
been built to the standard rms surface tolerance. The points obtained 
by performing this exercise for several different antennas are shown 
in Fig. 4. While agreement is not perfect, it is considerably better 
than any possible fit to the unmodified raw data. 

Of course, the quality factors given in equation 4 are not unique. 
Other functions can be found which provide the same sort of qualita- 
tive trends. Functions can be suggested which permit finite limits to be 
placed on the possible improvement or degradation of the rms surface 
tolerance, and finite limits can be established for the associated cost 
as well. Such functions are more complicated than those actually 
chosen, and the implications of their use have not been investigated. 
Presently available information simply does not permit a definite 
choice to be made among all the possible functions that might be 
appropriate. The decision was to accept a set of functions that were 
both qualitatively reasonable and analytically convenient, as given 
by equation 4. 
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Fig. 4 — Revised standard cost vs diameter for existing antennas. 
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VII. MATHEMATICAL MODELS 


Combining the equations developed above, we find the sets of equa- 
tions which can be used to study the interrelation of gain, cost, diam- 
eter, surface tolerance and frequency, for both exposed and radome- 
enclosed antennas. For exposed antennas: 


aD? 
e= Fp F (5a) 
$ = a,D™ exp (aD +2 — 1), (5b) 
G = nla,DQ) exp —(as5eM)”. (5c) 


Here, « is the rms surface tolerance in millimeters, D the antenna 
diameter in feet, G the gain in absolute units, $ the cost in dollars, y 
the aperture efficiency, and © the frequency in GHz, corresponding 
to a wavelength ». Appropriate values for the constants are: 


a =13X107%; a =6.7 X 10°; a, = 2.22 x 10°?; 


a, = 3.20; a; = 4.19 X 10°’; n = 0.70. 


Expressions 5 are appropriate for a diameter range of approximately 
15 to 250 feet. For antennas with a radome: 


= AD. (6a) 
$ = exp (x — D[s.D* — 6,D*] + 8,D" (6b) 
G = RO)[n(BsDO)” exp —(B7e2)”] (6c) 


The terms have the same meaning as for exposed antennas. In the 
present model, the cost factor is applied only to the cost of the 
antenna. The total cost, however, includes the cost of the radome. 
The expression for gain has been modified by the factor R(A), which 
accounts for losses resulting from the radome. The system effect of 
the noise temperature contribution from the radome is not considered. 
Appropriate values for the constants in equation 6 are: 


6, = 4.6 X 107°; B. = 6.75 X 10°; 83 = 1.30; 
Bs = 3.20; B, = 4.19 X 10°’; n = 0.70. 


For rigid radomes: 6, = 1.28 X 10°; 6; = 1.85 
For air-supported radomes: 6, = 1.69 X 107; 8; = 1.65. 


Expressions 6 are appropriate for a diameter range of approximately 
30 to 500 feet. 
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VITT. EXAMPLES 


Much information can be gleaned from the models expressed by 
equations 5 and 6. Each set is comprised of two expressions among 
five variables. (The parameter x can easily be eliminated between the 
first two equations of each set.) Thus if any three are specified the 
other two can be found directly. There are so many possible combina- 
tions that no general solution curves can be given. It is simpler to 
enter the appropriate equations for each specific case and work out the 
result. 

Of more interest are the various optimization studies that can be 
carried out. We give the results of three specific studies here. These 
are obviously not the only such studies that could be done. The details 
of the necessary algebraic manipulations are omitted, since they are 
generally straightforward but often tedious. However, in each case the 
procedure is indicated. The discussion is phrased in terms of equa- 
tion 5 for exposed antennas. However, the procedure described is also 
appropriate for equation 6, with obvious changes. 

For all examples which include a radome, a rigid radome is as- 
sumed. In addition the attenuation factor, R(X), is set equal to 0.7938. 
This corresponds to the assumption of a 1 dB loss caused by the 
radome, independent of frequency. While this is probably a reason- 
ably good number to use for low frequencies, it is certainly an over- 
simplication for the higher frequencies in the range of interest. Results 
displayed as a function of frequency for antennas with radomes in- 
corporate the implicit assumption that the radome is designed to 
match the operating frequency at each point. In other words, the re- 
sults do not indicate the performance of a specific system as fre- 
quency varies, but imply that the radome design also varies to match 
the operating frequency. 


8.1 Hxample 1: Maximum Cost-Effective Antenna 

A maximum cost-effective antenna is defined as one which pro- 
vides the most gain per dollar. By eliminating either the parameter 
x or the rms surface tolerance ¢« in the appropriate equations 5, both 
the cost and the gain can be expressed as functions of the diameter 
and the remaining variable, « or x. The ratio G/$ is formed, and 
maxima of this expression, considered as a function of two variables, 
are sought using standard techniques. G/$ exhibits a single maximum 
in the frequency range of interest, and the location of the maximum 
depends on the frequency, as expected. The results of this example are 
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plotted in Figs. 5 and 6 for exposed and radome-enclosed antennas, 
respectively. The ordinates represent the diameter and the cost, 
plotted against a common abscissa, frequency. The gain at the point 
of maximum cost-effectiveness is indicated on the diameter curve. 

There is a much greater variation of gain with frequency for ex- 
posed antennas than for antennas with radomes. This is directly at- 
tributable to the diameters involved. There is relatively little varia- 
tion in cost for exposed antennas over the entire frequency range of 
interest. Such antennas cost about $500,000 regardless of the operating 
frequency. It is also notable that the maximum cost-effective anten- 
nas found in this example all operate well below their gain-limit point. 


8.2 Kxample 2: Minimum Cost Antenna for a Specified Gain and Frequency 


In this example, the operating frequency and the gain are specified, 
perhaps as a consequence of other system constraints. The first step 
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Fig. 5—Cost and diameter of exposed maximum cost-effective antennas vs 
frequency (Example 1). 
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Fig. 6—-Cost and diameter of radome enclosed maximum cost-effective an- 
tennas vs frequency (Example 1). 


is to substitute equation 5a into equation 5c with G and © specified. 
The resulting expression can then be solved for the parameter x(D). 
This expression for x is inserted in equation 5b, yielding cost as a 
function of diameter. The diameter which minimizes this cost is then 
found by differentiation. The algebra involved in this example is un- 
pleasantly heavy and numerical search techniques were used to deter- 
mine the minimum cost diameter for both the exposed and the radome- 
enclosed cases. These results appear in Figs. 7 and 8, respectively. 
Figure 7a shows diameter vs frequency, and 7b gives cost vs fre- 
quency, for several different values of gain. The same pattern is fol- 
lowed in Fig. 8. 

For the radome-enclosed antennas, the results of this example are 
represented by straight lines in logarithmic coordinates. The diameter 
vs frequency relations for exposed antennas are also straight lines in 
logarithmic coordinates at the lower frequencies, but exhibit a definite 
curvature at higher frequencies, particularly for the lower gains. The 
cost vs. frequency curves for exposed antennas illustrate the excep- 
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Fig. 7— (a) Diameter and (b) cost of minimum cost exposed antennas for 
fixed gain and frequency vs frequency (Example 2). 
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Fig. 8—(a) Diameter and (b) cost of minimum cost radome enclosed an- 
tennas for fixed gain and frequency vs frequency (Example 2). 
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tionally high cost of gain at low frequencies because of the large diam- 
eter antennas required. 

An interesting comparison can be made between the costs of equiv- 
alent exposed and enclosed systems by using Figs. 7b and 8b. For a 
specified gain, there is a range of diameters (or frequencies) within 
which an exposed antenna is less expensive than one enclosed in a 
radome. This range varies with gain. For other diameters (or fre- 
quencies) the radome enclosed system is a better buy for a specified 
performance. 


8.3 Hxample 8: Gain as a Function of Diameter for Fixed Cost and 
Frequency 


In this example, we do not consider the optimization possibilities 
directly, but we are interested in the gain as a function of diameter 
when the cost and the frequency are specified. For a given cost, x can be 
expressed in terms of the diameter according to equation 5b. In solving 
the model this way, we discovered that it was possible for x to be negative 
for certain combinations of cost and diameter. To avoid such a meaning- 
less outcome, the restriction z = 0.1 was imposed in this example. This 
is equivalent to restricting attention to antennas with rms surface 
tolerance no worse than 10 times the standard value. With x(D) deter- 
mined, equation 5a establishes the rms surface tolerance and the gain 
can be found from equation 5c with no difficulty. 

Figure 9 shows the results of this exercise for two different fixed 
costs and several different frequencies. This figure shows the expected 
trends quite clearly. Notice that the optimum or gain-limit point for 
each set of conditions can be identified readily from the figure. This 
point could have been found directly, of course, by a procedure similar 
to that used in Example 2. 

This example was not carried out for antennas with a radome. 


IX. SUMMARY 


Mathematical models relating cost, diameter, gain, rms surface 
tolerance and frequency have been developed for both exposed anten- 
nas and antennas with a radome. The form of the model in each 
case is a set of two equations among the five variables of interest. 
This model is much more complicated than others that have been sug- 
gested to relate the cost and diameter of ground antennas. However, 
it is also considerably more general and can be used to study a variety 
of possible trade-off situations. The major features of these models are: 
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Fig. 9— Gain vs diameter at several different frequencies with a fixed cost 
of (a) $1,000,000, and (b) $10,000,000 (Example 3). 


(t) The inclusion of an exponential factor in the cost vs diameter 
relation for exposed antennas. This reflects, at least qualitatively, the 
exceptionally high cost associated with large, high performance ex- 
posed antennas. 

(71) The specific correlation of the cost vs diameter and rms sur- 
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face tolerance vs diameter relations. As a result, costs are associated 
not only with diameter, but also with rms surface tolerance. 

(w) The introduction of the quality factors. These factors relate 
deviations from the standard rms surface tolerance to expected de- 
partures from the standard cost curve. Although qualitative in nature, 
these factors reflect acknowledged trends. Skeptical readers who can- 
not accept the form of the quality factors used in the present models 
are encouraged to supply their own. 


The two equations comprising each model can be supplemented by 
additional relations among the variables such as an rms surface tol- 
erance-wavelength relation, for example. There is virtually no limit 
to the kinds of optimization studies and trade-off investigations that 
can be carried out within the framework of the suggested models. Ex- 
amples of three such studies have been included. Through the credi- 
bility of the results, these examples further demonstrate the qualita- 
tive validity of the models. 

These models have proved valuable in the preliminary phases of 
communications system planning, where competing concepts can be 
compared in terms of the relatively gross features of the system. 
They are not intended to usurp the responsibilities of the antenna 
designer in any specific application, and it would be erroneous to ex- 
trapolate their utility to such levels of refinement. Minor revisions in 
the constants of these models, as a result of new information or even 
different interpretations of present data, are to be expected and en- 
couraged. However, such refinements should not invalidate the gen- 
eral applicability of the present models nor the qualitative conclusions 
drawn from their use. 
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Radiating Properties of Dielectric 
Covered Apertures 


By ELLIOTT R. NAGELBERG 
(Manuscript received May 21, 1968) 


This paper studies the effect of a dielectric sheath on the far-zone radiation 
characteristics of an aperture bounded by a perfectly conducting ground 
plane. An examination of the appropriate Green’s function predicts a 
significant broadening of the radiation pattern in a narrow band centered 
about that frequency at which a surface wave would begin to propagate 
along a grounded dielectric slab with the same thickness and permittivity 
as the sheath. The differences between this phenomenon and what 1s com- 
monly referred to as a surface wave are discussed. Experimental results 
are then presented for the broadside and end-fire (perpendicular to the 
aperture) radiation by a waveguide aperture, from which it is found that 
the theory does predict the essential character of the observations. 


I. INTRODUCTION 


In designing phased arrays for radar and other types of commu- 
nication systems, it is generally necessary to provide some degree 
of environmental protection. One configuration which suggests itself 
is the flush-mounted “radome” (Fig. 1), made of a suitable refractory 
ceramic such as beryllia, alumina, or boron nitride. However, since 
these materials typically have relative dielectric constants greater than 
unity, such a covering sheath can be expected to influence the electrical 
performance of the antenna and hence of the system to which it 
belongs. The two antenna characteristics of greatest interest are: 
(z) the individual element input impedance and (22) the radiation 
pattern corresponding to a given aperture illumination. 

The effects of a covering sheath on the input characteristics of 
infinite rectangular waveguide arrays have been discussed by Galindo 
and Wu, using a technique of spectral analysis.t Although the radia- 
tion pattern of a single element in an array environment can also 
be obtained as a by-product of this analysis, it is still useful to study 
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Fig. 1— Segment of a phased array antenna with protective radome. 


a single isolated element, particularly in order to distinguish charac- 
teristics which are associated with the element itself from those at- 
tributable to the periodicity of the structure. If the individual elements 
are identical, then the far-field radiation pattern of the total array 
can be represened as the product of the element pattern (in the array 
environment so that mutual coupling effects are included) and the 
array factor, which depends on the periodicity of the aperture and 
the overall amplitude and phase taper. The sheath does not alter 
this separability feature and can be regarded as affecting only the 
element pattern. 

As an example, we have chosen to study the radiating properties of 
a single continuous aperture in a ground plane, covered by a dielec- 
tric slab of known thickness and permittivity. A study of the appro- 
priate Green’s function shows that the principal effect of the sheath 
is a significant broadening of the radiation pattern, over a narrow 
band centered about the frequency at which a surface wave would 
begin to propagate along the grounded dielectric sheath alone. (It 
follows that under matched conditions there must also be a dip in 
the broadside radiation, as required by energy conservation.) The 
result of this broadening is to produce a sharp increase in the power 
radiated near the end-fire direction. It should be pointed out, how- 
ever, that despite the mathematical connection, this radiation differs 
from a “surface wave”? in two respects: (7) It belongs to the con- 
tiuous spectrum of the radiation field and thus should be regarded 
physically as a distortion of the far-zone pattern rather than a sepa- 
rate mode of propagation; it thus exhibits the characteristic inverse 
power law variation with distance from the aperture. (27) The end-fire 
signal amplitude is roughly symmetrical about the surface wave cut- 
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off frequency and thus does not itself exhibit a cut-off characteristic. 

First we outline the formal solution to the boundary value prob- 
lem, which is carried out by a Fourier transform analysis. Then we 
obtain the far-zone field by a saddle point approximation to the 
inversion integral, from which the broadening is clearly shown by 
direct evaluation. Finally, we discuss the associated experimental 
work and present results which serve to corroborate the theoretical 
analysis. We use rationalized MKS units and the (suppressed) har- 
monic time dependence e** throughout 

There is a mathematical question regarding the validity of the saddle 
point approximation under the conditions @ ~ 7/2 and koa = (koa), . 
For this combination of parameters, the denominator of the function 
F(a) in equation (11) vanishes at the saddle point. Notice that the 
presence of the cos a factor causes the integrand to be finite under these 
circumstances so that the procedure is still valid. Another point of view 
is that when the saddle point and “pole” coalesce, the residue vanishes 
and with it the necessary correction term to the steepest descent formula 
(see Ref. 3, p. 508). 


II. THEORETICAL ANALYSIS 


The problem is to determine the far-zone radiating properties of 
an aperture covered by a dielectric sheath, as shown in Fig. 2. It is 
‘assumed that the electric field is in the y direction and that all quan- 
tities are independent of y. In addition to its mathematical simplicity, 
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Fig. 2— Aperture of width 21 bounded by ground plane and covered by pro- 
tective slab of thickness a and dielectric constant er. 
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this two-dimensional situation can be realized experimentally by erect- 
ing parallel conducting planes perpendicular to the electric field. 

Using a plane wave (Fourier) spectrum representation of the fields, 
EH, is given in terms of the transform pair 


Ba,2) = 5- f(t, 2e™ ah (1 
GQh,a) = [Bye de™ de 2) 


where h, the transform variable, denotes the z-directed propagation 
constant of the particular plane wave component. 

The function G(h, x) thus satisfies the one-dimensional wave 
equation 


TOO, 2) + paca, 2) = 0 (3) 


where 
8B = (k’ — h?)? inside the sheath 
B = Bo = (k2 — h’)? outside the sheath, (4) 


k or ko being the respective wave number. The proper Riemann sur-_ 
face, shown in Fig. 3, defines the square root such that 


B=+(h —h)hreal, lal <k (5a) 
= +i(h? — k’)*hreal, |h| > k. (5b) 
Imh h - PLANE 
CONTOUR OF BRANCH CUTS 


INTEGRATION x 
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=e 





g. 3 — Definition A ayePes Riemann surface for evaluation of Bo = (ki — 
nye and B= (k? — h*)” 
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Conditions given by equations (5a) and (5b) are a result of the physical 
requirements that: (7) waves radiated far from the aperature travel in 
the -+-z direction and (77) slow waves propagating in the -:z direction 
decrease exponentially in amplitude with distance from the sheath. 

The boundary conditions are that 


G(h, 0) = Go(h) (6a) 
G, “ continuous at x = a (6b) 


where Go(h) is the Fourier transform of the aperture illumination. 
The principal interest is in the field in region 2, outside the sheath, 
whose transform function is denoted by 


Go(h, 2) = A(h)e*®**. (7a) 


In a similar manner we represent the field in region 1, inside the 
sheath as, 


Gi(h, x) = B(h)je*** + C(hje*” (7b) 
and by direct substitution into (6) we find that 
Go(h)e 8? 





A(h) = B (8) 
cos Ba — 7 B sin Ba 
This leads directly to the expression for EL, 
+00 —iBoa 
CE us | Golhye eiPoretht ah (9) 
2Qr i ° Bo 


” cos Ba — a 


The determination of the far-zone radiation pattern from integral 
representations of the form given in equation 9 generally proceeds 
by saddle point integration. We first transform into the polar coordi- 
nates r, 6 shown in Fig. 2, which are related to x, z by 


x = 7 cos 0 (10) 
z=rsin 6. 
Next, using the transformation h = kp sin a, we find that equation 9 


can be written in the form 


E,(r, 0) = 5 i Fader” cos o da (11) 


where the new contour is that given in Fig. 4. Notice also that in 
transforming from the h- to the a-plane, the integral is independent 
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Fig. 4 — Transformation h = sina showing new contour. 


of the sign of the square root and hence the branch cuts disappear. 
In order to compute the far-field we let kor > oo and use the saddle 
point method to determine an asymptotic formula for the integral. 
For purposes of finding the radiation pattern only, it is sufficient to 
observe that the saddle point occurs at a = 6 and that the angular 
variation of the field can therefore be specified in terms of the (un- 
normalized) function 


G(@) = Go(ko sin @) cos 6- T'(@) (12) 
where 7'(@) is the sheath transmission pattern given by 


-ikoa cos 0 
T(8) = : 


cos [k,a(e, — sin? 6)*] — Loe x sin [k,a(e, — sin’ 6)*] 


(ce, — sin’ @) 
(13 


T(6) is, of course, dependent on @ and can therefore be expected 
to alter, at least to some extent, the radiation pattern of the aperture 
for all angles. However, the most significant changes occur at angles 
near 6 ~ 7/2, and at frequencies in the vicinity of cut off for a TE, 
surface wave on a dielectric slab covering a ground plane.” These waves 
begin to propagate for respective values of k,a given by 


(hea), = AS (14) 


Under the conditions 6 = 7/2 and koa = (koa)n, the denominator 
of equation 13 vanishes, and although this behavior must be weighted 
by the fact that in equation 12, for 6 — 7/2, cos 6 — O, the overall 
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effect is still to produce a broadening of the pattern and therefore 
a resonant component of radiation near the end-fire direction. Typical 
curves for G(6) illustrating this effect are shown in Fig. 5 for values 
of koa centered about (koa),, with Go(kosin 6) = 1, « = 6.0. The 
results therefore pertain to the appropriate Green’s function. With 
regard to interpretation of these curves, it should be noted that, 
strictly speaking, the results cannot be applied at angles arbitrarily 
close to 90°. This is because eventually one reaches the outer boundary 
of the dielectric sheath in which a different solution must be used. The 
curves have been drawn up to 90° under the assumption that in the 
far field the sheath occupies a negligibly small angular sector. 
While the distortion of the radiation pattern has the physical ap- 
pearance of a surface wave, in the sense that its energy is most sig- 
nificant near the dielectric interface, it is actually quite different in 
several respects. First, the radiation discussed here belongs to the 
continuous, rather than to the discrete spectrum of the aperture radia- 
tion field. Its amplitude thus decays inversely with distance from the 
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Fig. 5— G(@) (normalized to unity at @ = 0°) for values of koa near (koa)s, 
Go(ke sind) = 1, er = 60. 
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aperture, rather than being independent of distance, as for an unat- 
tenuated mode guided by the slab-ground combination. Furthermore, 
the end-fire radiation does not exhibit a cutoff characteristic but 
instead has essentially a symmetrical amplitude variation about the 
frequencies given by equation 14. 

The surface wave modes excited by the aperture are, on the other 
hand, associated with the discrete spectrum of the radiation field, and 
are derived mathematically from the residues of the inversion integral 
with respect to its poles. These poles correspond to solutions of the 
equation 


cos Ba + 3 sin ba = 0 (15) 
where a) = (h2 — k2)?, ho being the surface wave propagation constant. 
There is no detailed discussion of surface wave excitation here because 
not enough information is yet available to accurately estimate the 
appropriate excitation coefficient. This observation is made, to some 
extent, a posteriori, from the fact that the “reasonable” assumption 
_ of an unperturbed TZ,>, waveguide mode seems to yield a surface wave 
component considerably higher than we have observed experimentally.* 
For example, theoretical calculations reported in the literature lead 
to the conclusion that the surface wave extracts an much as 50 percent 
of the total radiated power over a wide range of frequencies. If this 
were generally true, the effects would be visible not only as a sub- 
stantial] increase in end-fire radiation but as a corresponding loss in the 
broadside direction. Section III shows that neither of these effects 
was observed in the expected manner. 

A possible explanation for this discrepancy can be found in the fact 
that the surface wave excitation coefficient is proportional to Go(ho), 
the Fourier transform of the aperture field distribution, evaluated for 
the surface wave propagation constant h, . Since we are interested in 
dielectric constants greater than unity, ho > k), and it is therefore 
appropriate to cast Go(ho) in the form (assuming Ey is symmetric with 
respect to 2), 


Bo” ¢@ = D 
m (Ro/ko)”** 


where the C’,, are constants and E{”(z = 1) represents the mth deriva- 
time of the aperture field, evaluated at the edge.” Equation (16) is 
different from the analogous formula for the radiation field, which 
would be a series in direct powers of h/k» in which the coefficients would 


Gols) = OC (16) 
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be weighted integrals of the aperture field. This dependence of surface 
wave excitation on conditions at the edge of the aperture makes the 
problem very difficult to analyze, expecially since measurements have 
shown the field in this region to change significantly when a sheath is 
present.* A physical interpretation would be that surface wave excita- 
tion depends on diffraction rather than direct radiation. 


III. EXPERIMENTAL RESULTS 


The apparatus, illustrated in Fig. 6, consists of a network feeding 
an open X-band (WR-90) waveguide flanged to a ground plane. So 
as to simulate the two-dimensional situation analyzed in the previous 
section, parallel conducting walls in the H-plane were used, as shown 
in Fig. 7. With this configuration, no additional field components are 
induced (at least theoretically), and the fields are essentially inde- 
pendent of position in the E-plane. The ground plane and aperture 
were covered by a uniform slab of stycast material with dielectric 
constant of 6.0 and thickness such that the (TE), mode cut-off fre- 
quency was 10.0 GHz. 

The measurements determined the individual signals received by 
the end-fire and broadside detectors, which consisted of open-ended 
waveguides terminated with broadband matched crystals. These were 
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Fig. 6 — Apparatus for measuring broadside and end-fire radiation. 
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Fig. 7 — Use of parallel conducting planes to simulate two-dimensional geometry. 


compared with the power incident on the aperture by a ratio meter 
having one terminal connected to the input waveguide via a directional 
coupler. At each frequency the tuning stub was adjusted to keep the 
return loss below —30 dB; the results therefore pertain to a matched 
element. 

In order to avoid mutual coupling between waveguide detectors, 
respective measurements of broadside and end-fire signals were made 
with the other detector removed. Since the waveguides were mounted 
in brackets which could be rigidly connected to the conducting planes, 
repeatability was not considered to be a serious problem. 

Figure 8 shows the experimental results, which demonstrate the 
resonant end-fire radiation phenomenon. The peak of the end-fire 
signal occurs at 10.0 GHz, as expected, and the general shape of the 
curve, down to approximately 6 dB below the peak value (normal- 
ized to 0 dB maximum), accurately follows the theoretical behavior. 
The latter was determined by calculating cos 07'(@) for a value of 
6 = 88°. In this regard, it was found by direct calculation that the 
shape of the relative variation with frequency is not sensitive to 
small deviations from 90°. The result given in Fig. 8 should therefore 
be a reasonable estimate for the behavior of the waveguide aperture 
receiver. Notice that 0 dB is defined as the measured end-fire signal 
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at 10 GHz and that the theoretical curve is also normalized at this 
frequency. Thus no attempt has been made to arrive at an absolute 
corroboration, which would require both the solution inside the dielec- 
tric slab and a detailed description of radiation at the end taper. The 
deviations in the end-fire radiation in Fig. 8 below 6 dB probably 
result from these latter aspects. 

Under matched conditions, conservation of energy requires that 
an increase in radiation in any direction must be accompanied by a 
corresponding reduction in other directions. This is clearly evidenced 
by the decrease in broadside radiation in the vicinity of 10.0 GHz. 
The reduction amounts to approximately 3-4 dB, which is quite 
significant, perhaps even more so in terms of antenna performance 
than the increase in the end-fire signal. The displacement of the 
minimal frequency to the slightly lower value of 9.7 GHz is, in all 
likelihood, caused by the generally increasing nature of the broadside 
signal before the dip occurs, attributed to the slow increasing gain 
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Fig. 8— Observed end-fire and broadside radiation under matched conditions. 
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of the receiving aperture with frequency. This would tend to skew 
the curve slightly to the left.* 

In interpreting these experimental results, it is worth noting that 
strong coupling to a surface wave in this experiment would be seen 
in two ways. First, the end-fire radiation would exhibit a cut-off 
characteristic in which the amplitude would be relatively low below 
the cut-off frequency, in this case 10.0 gHz, at which point it would 
increase to a higher value. Second, at approximately the same fre- 
quency the broadside radiation would correspondingly decrease. Since 
the experimental results do not behave in this way, but rather in a 
manner predictable by a far field analysis, it is concluded that strong 
coupling to the surface wave component did not occur for this partic- 
ular set of parameters. 


IV. SUMMARY AND CONCLUSIONS 


The purpose of this study has been to examine, both theoretically 
and experimentally, the radiating characteristics of an aperture-type 
antenna covered by a dielectric sheath. Such an antenna might rep- 
resent, for example, an element of a phased array which has been 
protected against a high temperature environment. 

The first principal effect of the sheath on the pattern of a matched 
element is to introduce a broadening of the radiation pattern and a 
resulting component of radiation in the end-fire direction at fre- 
quencies near the cut-off frequency of a surface wave on the dielectric 
slab. However, it is important to distinguish this phenomenon, which 
is properly viewed as a distortion of the far-zone radiation pattern, 
from what is commonly referred to as a surface wave, which is a 
separate mode of propagation belonging to the discrete spectrum of 
the radiation field. No effects attributable to the latter were observed 
in the experiment, showing that the excitation was negligibly small 
over the measured frequency range. 

Under matched conditions, there must be a corresponding dip in 
the broadside radiation over the same frequency range. This re- 
quirement follows directly from energy conservation. 

It is concluded that a matched phased array element is not suf- 
ficient to guarantee that the antenna is functioning as required. Die- 
lectrics can have a pronounced effect on the directivity and efficiency 


* Recent computational studies by C. P. Wu corroborate this slight frequency 
shift of the dip in broadside radiation. 
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of aperture radiators, especially near frequencies which are charac- 
teristic of surface wave propagation along some part of the structure. 


V. ACKNOWLEDGMENT 


The author would like to thank Mr. P. J. Puglis, who actively 
contributed to all aspects of this work. Conversations with Dr. C. P. 
Wu were also most helpful. 


REFERENCES 


1. Galindo, V., and Wu, C. P., “Dielectric Loaded and Covered Rectangular 
Waveguide Phased Arrays,” BS.TJ., 47, No. 1 (January 1968), pp. 93-116. 
2. Collin, Re E., Field Theory of Guided Waves, New York: McGraw-Hill, 1960, 


p. 470. 

3. Mathews, J., and Walker, R. L., Mathematical Methods of Physics, New York: 
W.A. Benjamin, 1964, p. 78. 

4. Croswell, W. F., Rudduck, R. C., and Hatcher, D. M., “The Admittance of a 
Rectangular Waveguide Radiating into a Dielectric Slab,” IEEE Trans. 
Antennas and Propagation, AP-16 (September 1967), pp. 627-633. 

5. Erdélyi, A., Asymptotic Expansions, New York: Dover Publications, 1965, p. 46. 


A New Reference Frequency Standard 
for the L Multiplex System 


By W. A. KESTER 
(Manuscript received July 17, 1968) 


Four precision quartz crystal oscillators form the heart of a new solid- 
state precision frequency standard, designed to meet the requirements of 
the present and next generation of L multiplex carrier equipment. The 
output frequencies of 64 and 512 kHz are derived from the 4.096-MHz 
oscillator outputs by binary frequency dividers. A master and standby 
divider chain form a redundant path linking two of the oscillators and the 
passive output distribution circuits. Intentional transfers between master 
and standby channels can be made without introducing significant phase 
hits at the output. Automatic transfer between channels occurs upon a 
catastrophic loss of the output signal. Two additional oscillators and 
dividers are provided as “hot spares’? which can be manually patched 
into service. Critical points in the system are monitored by major and 
minor alarm circuits, and a very low frequency recetver-comparator provides 
a means for maintaining the oscillator frequencies to within one part in 
10° of a vlf standard frequency broadcast. 


I. INTRODUCTION 


The L multiplex system is a single-sideband suppressed carrier 
system in which the carrier frequency supplies in the various offices 
are synchronized by the transmission of pilot frequencies over a tree- 
like network. At each office the carrier frequencies necessary to per- 
form the modulation and demodulation steps are derived from the 
outputs of an “office master” supply which is phase-locked to the 
incoming synchronizing pilot tone. This “office master” supply, named 
the primary frequency supply in L multiplex terminology, has been 
described in previous literature.* 

The tree-like synchronization network originates at a single pri- 
mary frequency supply called the “system master.” While relative 
frequency accuracy between offices depends on the phase-locked 
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synchronization network, the absolute frequency accuracy of the 
pilot tones is dependent upon the absolute frequency accuracy of the 
system master. Although the free-running accuracy of a primary fre- 
quency supply is such that fairly appreciable periods of pilot outage 
can be tolerated, a highly accurate reference frequency standard 
is still required to ultimately control the system master. 

In the past this frequency standardization service has been pro- 
vided by Bell Telephone Laboratories, Murray Hill, New Jersey, 
where the Bell System Primary Standard of Frequency has been 
maintained to an absolute frequency accuracy of one part in 10° by 
periodic corrections. In recent years the need has arisen to replace 
the Murray Hill frequency standard with a more reliable and rugged 
solid-state version which would be located and maintained in a 
hardened telephone office. The new Bell System Reference Frequency 
Standard described in this article has been designed to meet these 
immediate requirements of the present L multiplex system, including 
recent requirements of the new L-4 coaxial system. In addition, sys- 
tem objectives and requirements for the new standard have been 
selected in order to anticipate the needs of at least the next genera- 
tion of multiplex equipment after L-4. 


II. SYSTEM OBJECTIVES AND REQUIREMENTS 


2.1 Frequency Accuracy 

The absolute frequency accuracy requirement for the reference 
frequency standard has been established as one part in 10°. This 
requirement meets the needs of the present L multiplex system (in- 
cluding L-4) and should be adequate for any future multiplex systems 
which may be developed over the useful life of the new standard. 

A survey of recent developments in the field of frequency standards 
and frequency measurement techniques reveals that the above re- 
quirement can be met economically and reliably by the use of a 
solid-state double-oven quartz crystal oscillator as the frequency 
source and the very low frequency broadcasts of the National Bureau 
of Standards as a comparison for periodic corrections of the oscillator 
frequency. The technique of measuring frequency offsets using a vlf 
receiver-comparator is relatively simple, and accuracies of a few 
parts in 10*° can be obtained in a measuring interval of several hours 
(see Section IV)... 

The accumulated oscillator frequency drift during the time between 
corrections (maintenance interval) is equal to the oscillator drift 
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rate multiplied by the maintenance interval. This implies that for a 
minimum maintenance interval of one month, the oscillator drift 
rate must be less than approximately 5 parts in 10'° per week in order 
to maintain the accuracy requirement of one part in 10°. This as- 
sumes that at the time of maintenance the oscillator is corrected to 
the lower limit if the drift rate is positive and to the upper limit if 
the drift rate is negative. 


2.2 System Reliability 

The overall reliability objective for the new standard is that it be 
able to provide continuous and accurate output signals to the L 
multiplex system. Although short periods of outage can be tolerated, 
the new standard has been designed to minimize their probability 
and duration. These reliability objectives have been met in the fol- 
lowing manner: 


(z) Solid-state circuits are used wherever possible. 

(1) Redundant circuits have been provided which are automatically 
switched into service in the event of catastrophic failures. Other 
redundant circuits can be manually patched into service in case of 
double failures or failures in the automatic switching circuits them- 
selves. 

(wr) Special circuits have been designed which allow routine main- 
tenance checks of the redundant switching equipment to be per- 
formed without causing either interruption of service or phase per- 
turbations in the L multiplex synchronization network. 

(wv) Critical areas in the system are monitored by major and 
minor alarm circuits. Visual and audible indications of trouble con- 
ditions are provided. 

(v) The equipment comprising the system has been designed to 
facilitate ease of replacement and repair. 


23 L Multiplex Interface 


The interface betweeen the reference frequency standard and 
the L multiplex synchronization network is the primary frequency 
supply which has been designated ‘“‘system master.”’ The requirements 
of the primary frequency supply thus determine the following re- 
quirements of the new standard: 


(t) Output frequencies should be 64 and 512 kHz to meet the 
requirements of two existing versions of the primary frequency supply. 
(i) The 64- and 512-kHz output signals should be square waves 
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at an approximate level of —23 dBm (fundamental component) and 
an impedance of 135 ohms, balanced. 

(22) Phase hits introduced at the output because of intentional 
transfers between master and standby channels should be less than 
20 nanoseconds. This requirement allows maintenance transfers to 
be performed without significantly perturbing the phase-lock circuits 
in the primary frequency supplies. 


III. SYSTEM DESCRIPTION 


3.1 General 


Figure 1 is a block diagram of the circuits comprising the primary 
signal paths in the reference frequency standard. The output frequency 
of the precision oscillators is 4.096 MHz, hence binary frequency 
dividers are used in deriving the output signals of 512 and 64 kHz. 
There are four precision oscillators (1, 2,3, and 4) and four frequency 
dividing channels (A, B, C, and D). A and B are master and standby 
channels in normal system operation. The outputs of these channels 
pass through gates which allow either channel A or channel B (but 
not both) to drive conjugate ports on the 64- and 512-kHz combiner 
hybrids. The output ports of the hybrids drive resistive distribution 
buses, and the conjugate ports drive the major alarm transfer circuit 
which detects a loss of signal at the hybrid output. The gates are under 
the control of the automatic switch circuit which initiates a channel 
transfer upon receipt of a command from the major alarm transfer 
circuit. Provisions are also made for manually reversing the state of 
the gates. 

The phase align and channel transfer circuit in conjunction with 
the capacitive phase shifters and the magnetic clutch assembly allow 
intentional manual transfers to be performed which introduce phase 
hits of less than 20 nanoseconds (less than 4° at 512 kHz) at the 
output distribution buses. This type of “hitless” transfer is initiated 
by a pushbutton switch on the front panel which enables the phase 
align and channel transfer circuit and the magnetic clutch assembly. 
The shaft of the phase shifter in the off-line channel is then con- 
nected to the shaft of a knob on the front panel by energizing the 
appropriate magnetic clutch. The knob is rotated, and when phase 
coherence (less than 20 nanoseconds phase difference) of the 4.096- 
MHz square waves in the two frequency dividers is detected, the 
counters in the off-line channel frequency divider are reset to zero 
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Fig. 1— Block diagram of principal circuits in reference frequency standard. 
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upon detection of the all-zero code in the on-line divider. At this 
instant a command pulse changes the state of the channel gates, 
thereby effecting the “hitless” transfer. Upon completion of the above 
sequence, the magnetic clutch is de-energized, and the phase align 
and channel transfer circuit is returned to its normal state. 

For additional redundancy, two “hot spare” channels have been 
included, either of which can be manually patched into the combiner 
hybrids should a failure occur in both channel A and B or the 
common control circuits. These hot spare channels, C and D, are 
complete with oscillator, divider, and driver circuits. 

Manual patching allows any of the four oscillators to be associated 
with any of the four channels. Under normal conditions, the two 
most stable oscillators (determined by vlf measurements) would 
be patched into channels A and B, and the outputs of channels A 
and B would be connected to the combiner hybrids. The flexibility 
provided by the manual patches at both the oscillator outputs and 
the hybrid inputs allows frequency service to be restored under a 
wide variety of failure conditions. 

In addition to the primary circuit functions there are a number 
of secondary functional blocks which are described in greater depth 
in sections to follow. They are: power supply circuits (Section 3.5), 
major alarm circuits (3.6), minor alarm circuits (3.7), vlf receiver- 
comparator circuits (3.8), and digital time-of-day clock (3.9). 


3.2 Precision Oscillators 

The solid-state precision oscillators used in the reference frequency 
standard are the double-oven quartz-crystal type.” * Figure 2 is a block 
diagram of the oscillator circuit. The crystal is an AT-cut polished 
plano-convex quartz plate designed to operate on its fifth mechanical 
overtone in the thickness-shear mode. The crystal is mounted in an 
evacuated enclosure and is housed in an inner temperature-controlled 
oven. A vernier frequency adjustment is provided which allows a 
variation of several parts in 10’ about a nominal value of 4.096 MHz 
by changing the bias on a varactor diode. The circuits comprising 
the oscillator (including the inner oven assembly) are surrounded 
by an outer temperature-controlled oven for greater temperature 
stability. 

The drift rate for the precision oscillators is a few parts in 10*° 
per week. Figure 3 shows a plot of frequency offset as a function of 
time over a period of several months for one of the oscillators. Drift 
rates such as these can only be realized after an initial warm-up 
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Fig. 2— Precision oscillator block diagram. 


period of approximately one month, during which the frequency grad- 
ually stabilizes. Any disruption of power after this time requires 
another period of stabilization before low drift rates can again be 
realized. For this reason considerable redundancy has been designed 
into the oscillator power supply arrangement (see Section 3.5). 


3.3 Binary Frequency Dividers 

The frequency dividers used in the reference frequency standard 
are six-stage binary counters with output taps at 512 and 64 kHz 
(see Fig. 4). The input to the counter chain is a 4.096-MHz square 
wave derived from the oscillator output sine wave signal. The input 
to the 512-kHz output flip-flop is the result of gating the 4096-, 2048-, 
and 1024-kHz signals. This technique reduces jitter and eliminates 
the cumulative delay uncertainty of several counter stages. A similar 
gating arrangement is provided at the input to the 64-kHz flip-flop. 
High speed switching transistors with storage times of less than 5 
nanoseconds are used in the counter circuits to further reduce delay 
uncertainty. These techniques are required to insure accurate opera- 
tion of the phase align and channel transfer circuit and to reduce 
jitter in the output signals. . 

Figure 5 is a schematic diagram of a single stage in the counter 
chain. The “toggle” input is triggered by negative-going pulses, and 
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Fig. 3— Typical precision oscillator drift characteristics. 


a positive pulse at the “reset” input forces the output to a logic zero, 
corresponding to the saturation voltage of the transistor. 


3.4 Phase Align and Channel Transfer Circuats 


The purpose of the phase align and channel transfer circuits is to 
allow the initiation of an intentional transfer between channels A 
and B which introduces a phase hit of less than 20 nanoseconds at 
the output buses. The basic transfer sequence is described in Section 
3.1. Fundamentally, the circuits must detect the phase coincidence 
of the 4.096-MHz square waves in both channels and the all-zero 
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code in the on-line channel in order to generate properly timed reset 
pulses for the off-line divider and channel transfer commands for the 
automatic switch circuit. Figure 6 is a block diagram of the detection 
and pulse generating circuit. 

The shapers (see Fig. 7) generate narrow pulses of less than 10 








GATE 
ENABLE RESETS TO 
DIVIDER B 
64 kHz SHAPER 
ra A Uaaeeiay 
MONO- | CHANNEL 
FROM COMMAND 
DIVIDER pomipins 
A 
4.096 MHz 
SHAPER 
gape A 
/ 
hone 
ae 
MH \ 
4.096 Zz 
SHAPER TRANSFER TO 
oe MONO- MONO- | CHANNEL A 
FROM AND] | STABLE STABLE | COMMAND 
DIVIDER GATE MULT |= MULTI- 
VIBRATOR 
B VIBRATOR 
ehhh 100 NSEC 
SHAPER 
A RESETS TO 
GATE DIVIDER A 
ENABLE 


Fig. 6 — Phase align and channel transfer, phase coincidence circuit. 


2192 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1968 





+12 
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nanoseconds basewidth from the negative-going portions of the input 
square waves. When the phase align and transfer circuit is activated, 
the appropriate diode AND gate is opened and the output of the gate 
is a properly timed trigger pulse which occurs only when the 4.096- 
MHz signals are in phase and at the precise instant of the all-zero 
code in the on-line channel frequency divider. A 100 nanosecond 
monostable multivibrator, which is triggered by the pulse, generates 
reset pulses for the off-line divider before the next 4.096-MHz cycle 
and inhibits the diode anp gate. A second monstable multivibrator 
which is triggered by the reset pulse generates the transfer signals 
for the channel gates. A special mode is provided for tests which 
allows the off-line counters to be reset but inhibits the output of the 
second monostable multivibrator, thus preventing an actual channel 
transfer. 


3.5 Power Supply Circuits 

In the frequency standard, dc-to-de converters have been used 
to isolate critical circuits from noise and voltage fluctuations asso- 
ciated with the central office battery. A total of eight converters are 
used in the system. Four converters supply power to the oscillators 
and four to the digital circuits. The modular plug-in construction 
of the converters facilitates replacement. 

A redundancy scheme is followed in the fusing and distribution of 
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the regulated voltages so that no single converter failure will cause 
a catastrophic failure at the output of the distribution buses. Figure 
8 illustrates the redundant power feed arrangement for the precision 
oscillators. Office battery voltage is brought to the frequency standard 
cabinet over two independent fused paths from the battery bus. Two 
high-power silicon diodes form an or gate which allows either path to 
fail (either open or short) without disrupting power to the de-to-de 
converters. The outputs of each pair of converters are also passed 
through or gates so that any one of the four converters may fail 
without disrupting power to any oscillator. The fusing arrangement 
insures that no single failure in either a diode, a converter, or a 
fuse to the left of the dotted line in Fig. 8 disrupts power to any 
oscillator. This fusing scheme also allows power to be temporarily 
removed from faulty diodes or converters, thereby allowing repairs 
to be made without disrupting power to the parts of the system not 
directly affected by the failure. A fuse alarm contact is provided 
on each fuse. This contact is connected to the fuse input if the fuse 
should blow, thereby notifying the appropriate minor alarm circuits. 


3.6 Major Alarm Transfer Circuits 

The major alarm transfer circuits monitor the output signal levels 
and generate a central office major alarm should the levels fall out- 
side preset limits. Figure 9 is a block diagram of the circuit. The 
alarm circuits are driven from the ports of the 64- and 512-kHz hybrids 
which are conjugate to the ports driving the distribution buses. Two 
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monitoring channels are driven from each hybrid. One channel drives 
a meter-relay with adjustable upper and lower limits. The bandpass 
filter in series with the meter allows this channel to detect gross fre- 
quency shifts as well as level variations. The second channel contains 
a circuit which detects a catastrophic loss of level. A central office 
major alarm is generated if either or both the meter indications drift 
outside the present limits of a few dB above or below the nominal level 
of —23 dBm. A command which initiates an actual channel transfer 
is generated only if both monitoring channels detect a fault. This 
logic insures that a single failure in either monitoring channel does 
not generate a needless transfer and a possible phase hit at the output. 

An input from the minor alarm circuits prevents an automatic 
transfer if a minor alarm condition exists in the off-line channel. 
Test switches are provided which allow a failure to be simulated, 
thereby verifying the proper operation of the major alarm circuits 
and the redundancy switch. This test can be performed without intro- 
ducing a phase hit at the output by first using the phase align and 
channel transfer procedure to bring the two signals into phase coin- 
cidence. 


3.7 Minor Alarm Circuits 

The minor alarm circuits monitor critical points associated with 
the oscillators, frequency dividers, and power supplies. In case of a 
failure in any of the monitored circuits, the central office alarm is 
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activated and lamps on the front of the cabinet allow the trouble 
to be quickly isolated. whe . : 

In order to insure a prompt indication of any significant shift in 
the output frequency of the oscillators driving channels A and B, 
the circuit shown in Fig. 10 constantly monitors the frequency offset 
between channels A and B. A similar circuit monitors the frequency 
offset between channels A and C. Two meter-relays with adjustable 
upper limits provide the alarm initiation. The meters are calibrated 
to read 12 parts in 10° full scale. Assuming that the frequency of only 
one of the three channels drifts out of limits, simple relay logic 
determines which of the three channels contains the fault. The moni- 
toring circuit (Fig. 10) uses a balanced modulator followed by a 
low-pass filter to generate the difference frequency. Zero-crossings 
of the difference frequency are counted by a six-stage counter for 
a 10-minute interval, and the count is read into a register. The binary 
output of the register is translated by a digital-to-analog converter 
into a signal which is proportional to the number of zero-crossings 
of the difference frequency in a 10-minute interval. Each count rep- 
resents a phase shift of 180° between the two 4.096-MHz signals, or 
122 nanoseconds. This corresponds to a fractional offset of 2.04 parts 
in 10° per count. With a six-stage counter, 63 discrete nonzero levels 
exist, hence a full count of 68 represents 12.9 parts in 10° frequency 
offset between channels. The signals which control the gates and reset 
the counters at 10-minute intervals are derived from the binary 
coded decimal time code outputs of the digital time-of-day clock 
by a decoder circuit. 
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Fig. 10 — Difference frequency detector. 
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Several other points in the system are monitored by various minor 
alarm circuits. They are: 


(z) Inner oven heater voltage for each of the four oscillators (devia- 
tions about a nominal level of more than one volt generate a minor 
alarm). 

(iz) Output of each frequency divider (loss of output of any off- 
line divider generates a minor alarm). 

(12) Oscillator power supplies (deviations of more than one volt 
generate a minor alarm). 

(zv) Digital power supplies (loss of any supply generates a minor 
alarm). 

(v) Fuses (any blown fuse generates a minor alarm). 


3.8 VLF Recewver-Comparator Circutts 


The vlf receiver-comparator circuits allow accurate frequency com- 
parisons to be made between any of the four channels and any receiv- 
able vif station. Frequency comparisons between any two channels 
are also possible. This feature allows frequency standardization to 
continue using any of the four oscillators as a local standard should 
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normal vif receptions be impaired for significant periods of time, 
such as during a national disaster. 

Figure 11 is a block diagram of the vlf receiver-comparator circuit. 
The selector switches on the front panel (see Fig. 17) allow any one 
of the ten possible frequency comparison modes to be selected. Inter- 
comparisons between channels A and B, for example, are made by 
connecting the 64-kHz signal from channel A to the reference fre- 
quency input of the receiver and a low-level 16-kHz signal derived 
from channel B to the antenna input of the receiver. This frequency: 
division to 16 kHz is necessary since neither 64 kHz or 32 kHz are 
valid received frequencies for the vlf receiver-comparator. 

The vif receiver-comparator accepts a 64-kHz reference signal 
input. The receiver then synthesizes a 1.1-kHz signal from this 64-kHz 
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Fig. 12 — (a) VLF strip-chart recording and (b) frequency offset template. 
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input and phase corrects it to a 1.1-kHz signal derived from the 
antenna input. The accumulated phase difference between the two 
1.1-kHz signals is plotted on a strip-chart recorder, from which ac- 
curate frequency offsets can be easily computed (see Section IV). 


3.9 Time-of-Day Clock 

The digital time-of-day clock is driven from the alarm port of the 
64-kHz combiner hybrid (see Fig. 1). Days, hours, minutes, and sec- 
onds are displayed on the panel by indicator tubes, and controls are 
provided which allow the clock readout to be synchronized with a 
suitable time-standard broadcast. The digital clock also generates a 
time code (binary coded decimal) which is used by the difference 
frequency detecting circuits (see Section 3.7) in deriving the periodic 
control pulses. 


IV. FREQUENCY STANDARDIZATION 


The technique of accurate frequency offset measurements using a 
vlf receiver-comparator is relatively simple and has attained wide- 
spread use in recent years.®° Because of the effects of propagation 
anomolies during sunrise, sunset, and darkness, vlf measurements 
usually are made only when both the vlf transmitter and receiver 
are in daylight. A typical strip chart recording is shown in Fig. 12a. 
The plot shows the phase difference between the 1.1-kHz signal derived 
from the 64-kHz source and the 1.1-kHz signal derived from the vlf 
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Fig. 13 — Maintenance of oscillator frequency accuracy by periodic corrections. 
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Fig. 14— Front view of reference frequency standard cabinet. 
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Fig. 15— Rear view of reference frequency standard cabinet. 


signal as a function of time. Normally, station WWVB (60 kHz) 
will be selected as the reference station.* 


The phase offsets which occur every hour are introduced by WWVB 
to serve as identification. The fractional frequency offset is simply 


* Station WWVB, Fort Collins, Colorado is operated by the National Bureau of 
Standards Time and Frequency Division at Boulder, Colorado. The 60-kHz trans- 
mitted signal is based on the atomic second which is defined in terms of a speci- 
fied transition between electron energy levels of Cesium-133. 
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the ratio of the accumulated phase shift to the time interval over 
which the phase shift occurs. For the example shown in Fig. 12a this 
ratio is approximately 2 parts in 10*°, the sign of the slope indicating 
that the oscillator frequency is low with respect to WWVB. Meas- 
urements of this type can be quickly made with the aid of a special 
transparent template which is calibrated directly in fractional parts 
in 10*° as shown in Fig. 12b. 

The procedure for maintaining an oscillator to within one part 
in 10° of WWVB transmissions is as follows. A series of weekly fre- 
quency offset measurements are made in order to establish the oscil- 
lator’s approximate drift rate. The maximum allowable maintenance 
interval is then determined based on the drift rate and the accuracy 
requirement of one part in 10°. A plot of frequency offset versus time 
for an oscillator being maintained to the above accuracy is shown 
in Fig. 18. The periodic correction of 2 parts in 10° is made using 
the calibrated vernier frequency adjustment on the oscillator. 


V. EQUIPMENT DESIGN 


The equipment comprising the reference frequency standard is 
housed in a special seven-foot high steel cabinet which is designed 





Fig. 16— Drawer I. 
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Fig. 17 — Drawer II. 


to mount standard 19-inch panels or drawers. Front access is through 
a locking safety-glass door (Fig. 14) and rear access through a lock- 
ing steel door (Fig. 15). In the hardened underground site the cabinet 
is suspended from shock mounts. All input and output cabling is 
through holes in the top of the cabinet. The vlf antenna is mounted 
outside the entrance to the site. 

The overall front cabinet panel arrangement from top to bottom 
is as follows: digital clock (mounted on slides) ; meter panels indicat- 
ing frequency offset, output levels, and inner oven heater voltages; 
drawer I containing frequency dividers, phase align and channel 
transfer circuits, and certain alarm circuits (mounted on slides); 
drawer II containing frequency comparison selector switches and 
logic, difference frequency circuits, and alarm circuits (mounted on 
slides) ; vlf receiver-comparator (mounted on slides); writing shelf; 
and oscillator panel which swings open for access to precision oscilla- 
tors. 

The front panels have been designed so that most normal mainte- 
nance functions can be performed from the front. The two sliding 
drawers give easy access to the critical digital circuits. Connectors 
at the rear of each drawer allow the entire drawer to be easily 
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removed from the cabinet should extensive maintenance be required. 

Drawers I and II are shown in Figs. 16 and 17, respectively. The 
meters on the front panel of drawer I display the phase difference 
between the 4.096-MHz signals in dividers A and B and between 
the 64-kHz signals in dividers A and B. The pushbutton initiates the 
operation of the phase align and channel transfer circuits, and the 
knob allows the off-line phase shifter to be rotated. The meters indi- 
cate proper operation of the circuit when they read zero after the 
channel transfer has occurred. 

The front panel of drawer II contains the frequency comparison 
selector switches which allow any possible frequency comparison to 
be made by the vlf receiver. On-line indicator lamps and various 
major and minor alarm lamps are also on this panel as well as the 
alarm cutoff pushbutton switch which silences the office alarm. 

A major portion of the circuits in the frequency standard are 
mounted on printed wiring plug-in cards for ease and speed in re- 
placement of defective units. The cards on which high-speed logic 
is performed have a solder-plated ground plane on the component 
side as shown in Fig. 18. This ground plane is returned to chassis 
ground via the connector. In this manner a good sink is provided 
for the large transient ground currents which are generated by the 
high-speed switching circuits. Other low-speed circuits are mounted 
on a more standard card of the type shown in Fig. 19. 

The units which can be reached from the rear of the cabinet (see 








Fig. 18 — High speed plug-in card: (a) component side, (b) printed wiring side. 
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Fig. 19— Standard plug-in card: (a) component side, (b) printed wiring side. 


Fig. 15) are (from top to bottom): fuse panel; digital power supply 
panel; rear panels of Drawers I and II; rear panel of vlf receiver; 
oscillator patching panel; rear of oscillators; and oscillator power 
supply panel. 


VI. CONCLUSIONS 


The new Bell System Reference Frequency Standard has been 
designed to meet the frequency accuracy, reliability, and maintaina- 
bility requirements of the present L multiplex system as well as the 
next generation of carrier equipment. The system has been installed 
and tested in the hardened underground telephone office, and a cutover 
is planned in the near future. 
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Analysis and Synthesis of a Digital 
Phase-Locked Loop for FM 


Demodulation 


By G. PASTERNACK and R. L. WHALIN 
(Manuscript received July 12, 1968) 


A method of synthesizing a general nth order phase-locked loop 1s pre- 
sented. In contrast to conventional phase-locked loops, the circuitry is 
digital rather than analog. The general circuit consists of an assembly of 
logic blocks (gates and storage elements) which, when driven by external 
clock signals, exhibits phase-locked loop properties. These properties, along 
with high stability and the absence of adjustments, make the digital phase- 
locked loop ideally suited for use in large systems which use monolithic 
integrated circuits for microminiaturization. Analysis and synthesis tech- 
niques make use of Z-transform methods in achieving the desired frequency 
response as the realization of an nth order difference equation. A general 
technique ts developed and two specific cases, n = 1 and n = 2, are con- 
sidered in detail. Analytic results relating to the phase-locked loop’s static 
and dynamic performance are derived and found to corr elate well with 
laboratory results for actual circuits. 


I. INTRODUCTION 


A new phase-locked loop (PLL) with interesting properties has 
been developed for potential application in large multiple data set 
installations which provide low speed serial data communications 
for time-shared computers. An objective for such data set arrange- 
ments is to minimize cost per channel by putting the major part of 
the required circuitry into a common section where it may be shared 
by all channels. This objective is achieved by using a digital PLL 
as an FM demodulator with low cost logic circuits located in the 
channel units and clocks with their associated driving amplifiers 
located in the common circuits. PLL’s which use analog circuits have 
received considerable attention and analysis and synthesis methods 
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are available: However, the circuits covered here are digital, and 
the approach is similar to that of digital (or sampled-data) filters.* ¢ 
By using a digital PLL, no low-pass filter or voltage-controlled oscil- 
lator, generally associated with the feedback loop of conventional 
PLL’s is required.* This property, along with high stability and the 
absence of adjustments makes the digital PLL ideal for microminia- 
turization using monolithic integrated circuits. 

This paper presents snythesis procedures for an mth order digital 
PLL. The PLL realized by such a procedure possesses a response 
which obeys a linear nth order difference equation. Analysis is per- 
formed using Z transform methods commonly encountered in sampled- 
data control systems.® The technique is that of establishing a mapping 
between the s plane and the z plane so that a correspondence between 
the coefficients of the controlling nth order difference equation and 
the desired s plane poles may be established. An iterative circuit 
is presented so that once the coefficients are determined, the nth 
order loop may be realized. 

The remainder of the paper is devoted to the synthesis, realization, 
and analysis of two specific examples, n = 1 and n = 2. Both systems 
are analyzed to determine static and dynamic performance, and the 
results of data transmission tests are given. The out-of-lock behavior 
as well as internal noise resulting from jitter is characterized for the 
first order PLL. The second order PLL is representative of higher 
order systems and the analysis is easily extended. The “capture 
phenomenon” associated with underdamped systems is encountered. 
In both of the examples considered, experimental results are found 
to correlate well with theory. . 


II. THE DIGITAL PHASE-LOCKED LOOP 


As a prelude to the synthesis procedure we show that the loop 
response of the PLL can be expressed as an nth order difference 
equation. Figure 1 is a block diagram of the loop. Among its basic 
components are an exclusive oR comparator which develops an output 
gating function dependent upon the phase relation of its inputs, and 
a transmission gate acting on several clock signals f;, g; to provide 
inputs to register circuitry. When the loop is locked, a shift circuit 
periodically transfers the contents of the (2 — 1)th register to the 
ith register; the period is one half that of the input signal. The shift 


* Although no internal loop filter is needed, a low-pass filter is required to 
recover the demodulated baseband signal. 
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TABLE I—SyMBOL 


Real part of z-plane pole, 2;. 

Imaginary part of z-plane pole, 2;. 

Parameters pertaining to stability analysis of second order PLL. 

Normalized total input frequency deviation. 

Total input frequency deviation. 

Lock range of PLL. 

Voltage step. 

Peak-to-peak noise voltage. 

Peak-to-peak signal voltage. 

Time errors resulting from phase discontinuity when switching 
between clocks. 

Real and imaginary parts of Butterworth characteristic with carrier 
frequency input. 

eanen of kth positive (logical ‘1’’) level of feedback (flip-flop) 
signal. 

Interval between kth and (k + 1)th zero crossing of input signal. 

Real part of s-plane pole, s;. 

Duration of kth positive (logical ‘‘1”’) level at exclusive- or output. 

Radial cutoff frequency of low-pass filter. 

Imaginary part of s-plane pole, s;. 

Counting capacity of N-stage counter. 

Input voltage. 

Feedback voltage. 

PLL output voltage. 

Frequency of input signal. 

Discrete input frequency. 

Cutoff frequency of low-pass filter. 

Average input (carrier) frequency. 

Rest frequency of PLL. 

End point of frequency lock range. ; 

Clock frequencies (7 = 1, 2, --- , n); also denotes input to registers. 

Normalized clock frequencies (¢ = 1, 2, -+- , ). 

eee function (Butterworth). 

Counting indices. 

Threshold of final register. 

Order of system; number of registers. 

Number of counter stages per register. 

Minimum N for stable operation. 

ih s-plane pole. 


ime. 

Period of baseband data signal. 

Signal durations for stability analysis. 
Average output voltage during kth interval 
Continuous time function. 

Average output voltage for parasitic mode. 
Argument of z-transform. 

Z-transform of v(k). 


(and reset of the first register) is controlled by the nth register which 
provides an output pulse after the Mth clock pulse is counted. A flip- 
flop converts the output pulse train to a square wave and provides 
an input to the comparator. 

The difference equation describing operation of this circuit is de- 
veloped with the aid of waveforms shown in Fig. 2, which gives the 
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Fig. 1 — Block diagram of general digital PLL. 


steady-state terminal signals for the comparator. The output signal 
is the gating function. Assume that clocks gi, gz, 93, .- +3 Yn are 
enabled during the “0” level, and all other clocks f;, fe,..., fn are en- 
abled during the “1” level. With the first register initially cleared, its 
count at the conclusion of the (k + 1)th period of gating is gi[p(k) — 
r(k)] + fir(k + 1). During each successive period, this count is 
shifted into the ith register and augmented by a count of gi[p(k + 
t—1) —c(k +%7—1)] + fir(k +7) fort = 2,8,...,. The count 
propagates through the n registers where, upon reaching the number 
M at the nth register, the count is reinitiated. Although n periods are 
required for a complete count cycle, the process may be thought of as 
the interleaving of cycles initiated p seconds apart. 
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III. GENERAL FORM OF THE DIFFERENCE EQUATION 


It can be seen from Fig. 2 that the count of the nth register is the 
sum of pulses counted during the intervals p(k) — r(k), p(k + 1) — 
r(k + 1),...,p(K+n-—1) —7r(k +n —1),7(k + 1), 7(K + 2), 

., t(kK + n). Summing the counts for the sources fi, fo,..., Sn; 
91) 92,+++)9n and equating the sum to M gives 

gile(k) — rh)} + glok +1) —-7rk+)D)+-:-: 

+ glee +n — 1) — rk +n — 1)] 
thrk+D+---+f7rh+n) = M, (a) 


or rewriting, 


far(k + 1) + aa = ga)r(k bm = 1) toe = gur(h) 
= M = [glk +2 — 1) + gaaplk n= 2) + 
+ gop(k + 1) + gip(k)]. (1b) 
INPUT SIGNAL p= a 


ie — 
~~ p(K)->< p(K+1)>4<= pl(kt2)>] ++ + ETC 
o--— 


FEEDBACK SIGNAL 


j-----— _— 


Ef --&(k)-->be &(K41) E(K+2)>| 6 ++ ETC 


EXCLUSIVE -OR OUTPUT (GATING SIGNAL ) 
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Eo 
o-—— —_ 
Sani Sabine ke af +++ etc 
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Fig. 2— Input, feedback, and output waveforms for PLL which is assumed to 
be locked with constant input frequency. Gated clock signals are shown 
symbolically. 
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This is the system difference equation relating the response 7(k +.1) 
to an excitation p(k + 7). In its general form this equation, when 
Z-transformed, is analogous to a Laplace transformed system equation 
in which polynomials in s multiplying the response and excitation 
functions result in poles and zeros, respectively. Since there is a 
one-to-one correspondence between the s and z planes with regard to 
poles and zeros, it is expected that by properly choosing coefficients, 
equation 1b may be synthesized to provide a desired frequency re- 
sponse (high-pass, low-pass, bandpass, and the like) possessing spe- 
cified critical frequencies. As we show, it is the objective of this paper 
to exploit the low-pass properties of equation 1b. To achieve maximum 


high frequency attenuation, the coefficients g;,7 = 2,3,..., n will be 
set equal to zero thereby locating all z-plane zeros at infinity. 

fait(k + n) + faitv(k +n — 1) +--+ — gr(k) = M — gio(k). (1c) 
Equation le is normalized by letting F; = f;/fa,j = 1,...,n —1 


and G, = g:/f,. Also, notice that the cycle-by-cycle average voltage 
of the r(k + 7) interval expressed as a fraction of the maximum 
possible voltage is given by 


nk +) = EES. 


Incorporating these substitutions gives 


vk +n) + F,wk +n —-1) +++ + Fwk + 1) — Gv(k) 
= 2MF(k)—-G, (8) 


(2) 


where 


on See 
2fno(k) 

If it is assumed that p(k) changes very little with k and that p(k) 
is small with respect to the system response time (which it is), the 
v(k) can be represented as a sampled continuous function of time, 
u(t), letting ¢ = kp. That is, v(t) is a function whose value at the 
kth zero crossing of the input signal is v(k). 

This type of equation is best solved using z transform methods 
assuming that the input frequency is constant. In particular, the 
following transform pairs are noted: 


Fk) = 


co 


Ziv(t)] = Zo(k)] = VE) = Dd v(jp)e’. (4a) 


7=0 
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Z[A exp (ak)] = to (4b) 
Zio(k + = 21VO) — Dev - d. (40) 


Accordingly, equation 3 is transformed as follows: 
z"[Vi2)] — do z’v(n — J 
t=1 


4a. SV Oa un PD aos 





+ FaV® — 0] — GV@ = MF - ay; = | (5) 


Combining terms, 


[e+ Fie + eee + Riz — GVO 


= (2MF — ay(. = -) + yD » FevG-), © 





where 


Fy, = 1. 
IV. GENERAL SYNTHESIS PROCEDURE 


Before developing a synthesis technique for the digital PLL, it is of 
interest to review a specific application, that of FM demodulation. A 
“lock range’? may be defined for the PLL whereby steady state input 
signals having constant frequencies lying within this range will cause a 
steady-state output, v(k + 9) = v(k + 7+ 1), allj, such that this output 
is linearly related to the input frequency f = 1/2. This is the relation 
required for demodulation. 

For dynamic behavior one may consider a binary baseband signal in 
which each of the two states is assigned a discrete frequency f., f, within 
the lock range. If the baseband signal switches randomly between states 
with a maximum rate 1/7 a new phenomenon is introduced. In this 
context the PLL may be regarded as a low-pass filter which should 
possess a bandwidth, w,, equal to or greater than 7/7. By proper choice 
of the coefficients in equation 6 the desired filter shaping may be a- 
chieved. It is expected that the low pass filter characteristics will be a 
function of the input frequencies f,, f,. The resulting complication is 
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conveniently eliminated by making use of the narrowband approxima- 
tion fa, fs © fm where f,, = 3(f. + fr). This is not unrealistic, because the 
PLL was developed for just such a narrowband system.* With this in 
mind, we now give a synthesis procedure. In using Z transform techni- 
ques it is assumed that the input frequency is approximately constant 
so that samples are taken at equal time intervals. 

The coefficient of V(z) in equation 6 is the “characteristic poly- 
nomial” of the system and using it, the desired low pass filtering 
properties of the loop can be synthesized. For example, assume that 
a transfer function with poles in the s plane at 81, s2,... , S, 1s to be 
synthesized. Its characteristic polynominal is given by 


n 


IG — 5). (7) 


7=1 
A conformal mapping between the s plane and z plane is given by 
the transformation 


z = exp(ps), (8) 
by which the pole s; = o; + tw; is mapped into 
z; = exp [(p) (o; + %a;)] 
= [exp (pc;)] [cos (w;p) + isin (w;9)]. (9) 
Since the complex s plane poles occur in conjugate pairs, and sin (w;p) is 


an odd function, the corresponding z plane poles also occur in conjugate 
pairs, and the desired characteristic equation is transformed into 


U @~a)=2+ Aw ++: $Aet Ay, (10) 


where each of the coefficients are real. Equating coefficients in equa- 
tion 6 to those in equation 10 gives the required clock frequencies 
91, f1,.--,5n. A negative value implies an associated register which 
counts “down” while a positive value suggests a register which 
counts “up.” 


Vv. FIRST ORDER DIGITAL PLL 


5.1 Static and Dynamic Behavior 
The difference equation for an n = 1 PLL is easily written from 
equation 3: 


* Full duplex transmission with fa = 1070 Hz, f. = 1270 Hz in one band, and 
fa = 2025 Hz, fp = 2225 Hz in the other band. 
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vk +1) — Gyw(k) = 2MF — Gi, (11) 


in which M = 2% and N is the number of counter stages in the 
register. For a steady state condition to exist, v(k + 1) = v(k) so that 


_2MF-—G, _ 2Mf-— 9g 
ae I1-G fim a 


Since 0 S v(k) S 1 the end points of the lock range are given by 


f | scxy-0 =fi= g:/2M ; f | het = fu = f,/2M. (13) 
And so the lock range is 


af. = Ite — hl = sazla— fl. (14) 


The static response is diagrammed in Fig. 3. 
The dynamic response to a step change in frequency is given by 
equation 6 forn = 1. 


2 —-G]V@ = 2MF — eq 25 + 2v(0). (15) 


A partial fraction expansion yields 


ro = ell 
+ [oF SA a) os 


ir ~—LOCK RANGE at 
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Fig. 3— Average output voltage versus input frequency, for first order PLL. — 
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Assume that for time t < 0 the normalized input frequency is Fy. 
At t = Othe input steps to F,. Then from equation 12: 


2MF, —G 


Substitution into equation 16 gives the z transform of the response 
fort = 0 


vo = eS a]+ Ate a) «9 


The inverse transform is easily found with the aid of equation 4a. 


(17) 


2MF,—G, 2M(F, — F.) 


IS aan i-G, (Gy | 
_ 2Mf,—g 2M (fs — fa) (gr\* 
er aa (2) for k = 0. (19) 


Assuming that v(k) is a continuous function of time, we let v(k) = 
v(t) and t = kp = k/2f,. This gives 


SMe oe 2M St) ei olin Gale. 20) 


j= 
se hi-% ho 
The resulting time constant is 
ge eee (21) 
2f, In (f:/g:) ’ 
and thus the half-power bandwidth is 
I 
= 7 fo In (f1/g1). (22) 


Notice the dependence of filter shaping upon input f,. It follows that 
there is a somewhat different time constant for input frequency 
changes from f;, to f,. 

A simplified first order digital PLL has been built as shown in Fig. 
4, It is a special case of the general system of Fig. 1. The FM input 
was composed of the discrete frequencies f, = 1070 Hz and f, = 1270 
Hz. Clock frequencies g; and f; were chosen as (2M) (970) Hz and 
(2M) (1870) Hz, respectively, to provide a lock range of twice the 
total input frequency deviation. M was chosen to be 128 (see Section 
5.3). The above values gave a time constant, 7’, of 1.13 ms. Inter- 
changing f, and f, resulted in an increased time constant of 1.34 ms. 
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Fig. 4—Block diagram of simplified first order PLL FM discriminator. 


Oscilloscope photographs showing the two step responses of the actual 
loop are shown in Fig. 5. 


5.2 Out-of-Lock Oscillation 
If there is no input to the PLL, it will run at a rest frequency of 





M uy" 1 sgh 
S(t) =e : 23 
Greer Mo +h 28) 
Notice, however, that one half cycle will be at g,/2M and the next 


at fi/2M. 

If an input to the PLL is present, but its frequency lies outside 
the lock range, that is, if f < f,/ 2M or f > g,/2M, the output voltage 
v(t) will oscillate between 0 and 1. Figure 6 shows waveforms for a 
loop which is out of lock. For time to the left of the dotted line, the 
loop is attempting to lock and the frequency of the feedback signal, 
E,, is approaching that of the out-of-lock input #. Within this region, 
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(b) 


Fig. 5— Output voltage response of first order PLL to input step frequency 
of (a) 1070 to 1270 Hz and (b) 1270 to 1070 Hz. Horizontal scale: 0.5 ms per cm. 


equation 16 applies and is rewritten in the time domain as 
2Mf — a) i 2Mf — a |(ay 
= (—4—4 ~S | 2) 24 
a ( h-% aa Lee hii-— fh Se 


‘For time to the right of the dotted line in Fig. 6, a new difference 
equation applies. It is written as 


gloj) ~ 7G +1) + fir) = M (25) 


so that a derivation similar to that previously used results in 


oo ~ [2-2] (0) -1J+0(8). on 


Equations 24 and 26 are represented graphically in Fig. 7. M, gi, and 
fi take on the values of the previous example and an out-of-lock 
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Eo | 
> {kk >| —— | le--->| |e--s| 
rT(k) 7 (kK+1) T(j) 75+) 
GATED | 
clock f, g, f, f Qh f 


Fig. 6— Input, feedback, and output waveforms of nonlocked first order PLL. 


input frequency f = 1400 Hz is chosen as the input. The positive 
slope segment to the left corresponds to equation 24 with an initial 
voltage v(0) assumed to be zero. When v(k) reaches unity, the next 
segment is governed by v(j) with an initial condition v’(0) = 1. 
When v(j) reaches zero, the response is again given by v(k) and with 


1,0 


° ° ° 
ES a @ 


AVERAGE OUTPUT VOLTAGE , V 
° 
iw) 


TIME, t —> 


Fig. 7—Single cycle of output average voltage oscillation for nonlocked first 
order PLL. 


2220 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1968 


—V=i 





Fig. 8— Actual output voltage of experimental first order PLL showing out- 
of-lock oscillation. Horizontal scale: 1 ms per cm. 


initial condition adjusted accordingly. The actual waveform for the 
experimental PLL using this example is shown in Fig. 8. 


5.3 Internal Noise From Phase Discontinuity 


The clock signals are unsynchronized, and thus a random phase 
discontinuity results at the time of gating. This results in an internal 
noise voltage which exhibits itself as a fluctuation in the output voltage. 
This effect can be thought of as a quantization noise of the phase. A 
portion of Fig. 2 is redrawn in Fig. 9 to show the time errors €9, ¢}, and e 
at the instant of gating caused by differences in clock phases. It is 
assumed that the frequency of the ripple is low enough so as not to be 
attenuated by the PLL filtering property. With this in mind, difference 
equation 11 may be amended to account for the phase error. 


Ir(k + 1) — a(k + U)If 
+ [p(k) — r(k) — e(k) — &()l +2= M. (27) 


For a constant input frequency and very slowly varying 7r(k), 7(k + 1) = 
t(k) so that 


7k) [hh — gi] = M — gp + fralk + 1) 
+ gileo(k) + @3(k)] — 2, (28) 
or 
[o(k)) [fh — gu] = 2M — gy + 2ffia(k + 1) 
+ 2fgileo(k) + g(k)] — 4f. (29) 
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Avy is defined as the difference in voltage for maximum and minimum 
(zero) phase errors. Thus 


Avy = | L_| [frer(& + 1) [max + gileo(A) |max + €6(K) |max)]. (80) 


Since 


Aba DS ae lace = ~ 


1 
hi 
then 

Gf 
Am fi—- gm’ 
where f is approximated by the carrier f,, for the narrowband case. 





Avy = 


(31) 





Phen 


CLOCK 


CLOCK | 

f; 
oS 
Eo(K) e%{k) 


Fig. 9—- Waveforms for locked first order PLL showing how time jitter is 
caused by phase differences in clock signals. 


2222 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1968 


It may be seen that the peak-to-peak output for an incoming FM 
wave with phase continuity in the clock signals is given by equation 
12 as 
Asie a 2M, = 91 — 2Mf, ry 91 cs 2M Af 
h—- 9 hoi ae |e 
where Af is the total frequency deviation | f, — f,|. Thus the mini- 
mum peak-to-peak voltage signal-to-noise ratio within the loop is 





(32) 


— Arsig _ 2M(Af) _ ow Af, 

S/N = ie 2 Gf, (33) 
Continuing the previous example with Af = 200Hz and f, = 1170 Hz, 
it is desired that the internal noise be 20 dB below the signal so that 
2” > 361. This suggests a nine stage counter (2° = 512); however, it 
should be pointed out that a worst case of phase jitter has been assumed 
and the average phase jitter is less than this. Laboratory experiments 
have shown satisfactory results with an eight stage counter thus giving a 
minimum signal-to-noise ratio of 17.2 dB. 

It is apparent from equation 33 that the internal noise may be made 
as low as desired by choosing a sufficiently large value of N. Thus a 
direct relationship exists between equipment cost (number of counters) 
and performance (jitter distortion). 


VI. SECOND ORDER DIGITAL PLL 


The location of the desired s-plane poles must be specified and mapped 
into the z-plane. The following example is concerned with synthesizing 
the familiar Butterworth response, but the procedure is certainly 
applicable to other filter classes. 

A second order PLL is to be synthesized so that its response is that of 
a Butterworth low-pass filter with cutoff at w,. The 2n = 4 poles of 
H(s)H(—s) lie on a circle of radius w, and subtend equal ares such that 
the filter’s characteristic equation is 


8’ + (2)'a.8 + w? = 0, (34) 
with poles at 
Si.0 = oy (-} + i), (35) 


The corresponding poles in the z plane are 


21,2 = exp (81,29) = [exp (—a)] [cos (a) -& i sin (a)] 
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WT me 
a direre Gian eee=e aw te 36 
OF = Oy we 
Digressing for a moment, it is of academic interest to map the entire 
Butterworth circle into the z plane. This is easily done by letting s be a 
circle of radius w,, that is, s = w, exp (#6), 0 S @ S 2r. Substituting 
z2 = exp (sp) and approximating f by f, gives 


plas {esp |g cos  |Keos (2¢si ) + 7sin (2-5 
(37) 


The resulting cardioid-like shape is shown in Fig. 10 with fc/fm as a 
parameter. The angle ¢ is also shown so that portions of the Butter- 
worth circle may be conveniently translated into the z plane. 

Returning to the specific example, the characteristic polynominal is 
written using equation 36 as 


(2 — 2)(2 — 2) = 2° — [2 exp (—a)] [cos (a)Jz + exp (—2a). (88) 
The resulting coefficients are equated to the respective coefficients in 
equation 6, namely 2? + F14z—G so that: 














Fy = fi/f, = —2[exp (—a)][cos a] (39) 


and 


Gi = 9:/f. = —exp (—2a). 
A third equation is arrived at by fixing the steady-state voltage v(k) 
for a specific input frequency, f. The choice is arbitrary, and since the 
system’s input spectrum is symmetrical about the carrier frequency, 
fm, 1b is reasonable to fix the corresponding output voltage at 0.5. The 
equation is obtained from equation 6 by noting that steady-state im- 
plies periodicity so that v(k) = v(k+j). Thus: 


2 
Solving equations 39 and 40 gives 
2M fm exp (a) 


fs = cos (@) — sinh @) ’ ou 
_ 4M fn cos (a) 
h = cos (@) — sinh @) ’ Se 
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Fig. 10 — Mapping of the s-plane Butterworth circle into the z-plane. 


and 


2M fm exp (—a) 


91 ~~ Gos (2) — sinh (a) 


(43) 

As a practical example, consider the demodulation of a narrowband 
FSK wave whose spectrum is centered about a carrier of 2125 Hz. The 
signaling rate is limited to $ 300 baud so that a cutoff frequency of 250 
Hz should prove adequate. As previously noted, the choice of M is de- 
pendent upon the maximum baseband jitter (quantizing noise) which 
can be tolerated. This must be weighed against the added circuitry and 
higher clock frequencies imposed by large values of M. A convenient 
choice is M = 128. Thus 

mw fe 


a= 0) j,, = 0.262 
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so that 
Qi = 598 kHz, 
fi, = 1504kHz, 
and 
fe = —1012 kHz. 


As previously mentioned, the negative value for f. requires a “down 
counter” for the second register. This example is continued in sub- 
sequent sections. 


6.1 Step Response of the System 


Once the coefficients gi, f1,..., fn are determined, the characteristic 
equation is uniquely specified and the system’s time response to a 
step in input frequency is easily found. The step response of a second 
order system is now derived in the interest of providing further in- 
sight into the characteristics of the loop. 


Equation 6 is written for n = 2 as: 


[2 + Fiz— G|V (2) 
= (2MF — ey(—2-5] + vz + (0)2 + 2Fw(0). (44) 


Assume that the input frequency steps from F,, to F at t = 0 (as before, 
capitalization denotes normalization so that F; = f;/f., G; = 9;/f2). 
Thus for t = 0 we have 


Z 
Vie) = (MF — let ae 


2 
z 


a 10)| 2 =| + [Fiv(0) + Wl + ae = =|: (45) 


The initial conditions v(1) and v(0) are determined from the differ- 
ence equation 


v(k -+ 2) + Fiw(k + 1) — Gyv(k) = 2MF — Gy. (46) 


Assume that the system is in steady state for ¢ S 0 so that v(k) = 
vk —1),k $0. Fork = —2 , 


v(0)(1 + Fi — G] = 2MF, — G, (47) 
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so that 
=? 2MF, — G, : 
Now let k = —1: 
v1) = 2MF — G, — [F; — GJ (0), (49) 
which may be written as 
v(1) = (0) + 2M A, (50) 
where 
A= [F — F,]. (51) 


Substituting equations 50 and 51 into 45 and simplifying leads to 


IMA 
MG [oo + | 
= 2M A I= Z | 
1+ fF, — Guile + fe HG 


2M AG, Zz 
= | 2847 | | oO) 


The following substitutions are made so that V(z) may be easily 
transformed. Let 





F, = —2[cos 6] exp (—a); G, = —exp (—2a). (538) 


Substituting and rearranging gives 


ve = [00 + alll 


_ | 2MA || 2’ — 2(cos B) exp (—a) | 
1+ F, — G,JL2’ — 22(cos 8) exp (—a) + exp (—2a) 


4AM AG, _ -} 
= ( ay ee aa P, (4G, M) | 


2(sin 8) exp(—a) | ; 
Pome ol ee (5 


This is transformed into 











008) = | 000) + ee fue 
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— exp (—aip| (284 cos (Bk) 


where u(k) is the unit step. The time response is found by again assum- 
ing that v(k) is a continuous function of time v(Z), k ~ 2ft. Substitution 
gives the familiar response of an underdamped second order system: 


2MA 
v(t) = |» is ett uo 
2MA 
~ exp (-24t0| (FR) om a! 
4AM AG, -F)- oe |. 
a (, aR; _ G, Py ( 4G, I) sim (26ft) (56) 
The experimentally determined step response of the actual n = 2 


PLL closely approximates the Butterworth response and is shown in 
Fig. 11. 


6.2 Lock Range for the Second Order System 

As was the case for the n = 1 PLL, the loop is said to be in “lock” 
if, for a steady input frequency, the output v(k) is constant. Although 
there are many frequency ranges for which the loop exhibits this 
property, there is a “fundamental” lock range over which v(k) varies 
linearly with frequency between its defined limits, and the output 
frequency equals the input frequency. In steady state, v(k+2) = 
v(k+1) = v(k) so that from equation 40: 


OMF —~G, _ 2Mf— 4, 





= = i < < 
v(k) coe =O. Leo if. 0 S vk) £1. (57) 
The end points of the lock range are 
flemeo =f. = YT: (58) 
and 
floa=fh= Ak. (59) 


The lock range is given by 


Afi =|t—hl=srlachokh (60) 
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— f, =2025 HZ 


fi = 2225H2Z 





- 


Ff, =2025HZ —f§ 


seven gums 
Hy 
fe] i 
P| | 
an i 
. ae eae 
a 
S| ~ fg? *: 
" a4 


—f, =2225 Hz 


sa 


= 





Ss 


Fig. 11— Output voltage response of second order PLL to input step fre- 
quency of (a) 2225 to 2025 Hz and (b) 2025 to 2225 Hz. Horizontal scale: 1 ms 
per cm. 


Substituting the values calculated previously gives a lock range of 
414 Hz which is approximately twice the bandwidth required. The 
useful bandwidth is reduced still further as will be shown. 


6.3 Bound on Cutoff Frequency 
For a step in frequency f,—f, we have a corresponding step in volt- 

age, 

— 2Mj.— 9 _ 2Mf— m1 _ 2M (fa — fr) 
fth—-% f+th—-—n bPth—- 


Av is defined to have a maximum value of unity so 


fe t+h—7 2 2M(fa — fr) (62) 


Av (61) 
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and 


< 2M 
—- 1 
If f, and f, are symmetrically distributed about the carrier frequency, 
fe, we can let v = 1% at f; = fm so that from equation 40 


1+ F, — G, Ga — fs) for fe <0. (63) 


4Mfn 





THA +G i 
Substituting equation 64 into 63 gives 
1+h-Gsa+n+a(ee h). (65) 


Substituting equation 53 for F; and G; gives 
1 — 2[exp (—a)] cos B + exp (—2a) 


< (Graal — 2(cos 8) exp (—a) — exp (—2a)). (66) 
The constants « and £ are related to the filter shaping desired and are 
a function of the input frequency for ¢ > 0. For the case of a Butter- 


worth response (n = 2) 


sqgeesalhy: 

aaa ame) (f) 

For a narrowband system the input frequency may be approximated 
by the carrier f,,. Thus 


1 — 2(cos w) exp (—u) + exp (—2y) 





S [1 — 2(cos yw) exp (—x) — exp (—29| | » (67) 
where 
_ 7 fe 
oO. 
This may be written as 
jn + 
exp (u) — 7 xP (—2) 


ta aie 2 
cos p = aa aa (68) 
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Thus for a given f,, f, and fm, the cutoff frequency f, is bounded. For 
example let f, and f, be 2025 Hz and 2225 Hz, respectively, and be 
symmetrically distributed about the carrier f,. Then 


eee SP (u) + ae exp (~H) | 69) 
Solving this transcendental equation gives | 
pw = 0.19 (70) 
and 
f. = 182 Hz. 


An upper bound on f, is found by requiring that the end points of 
the lock range be positive frequencies. The lower edge of the lock 
range is required to be greater than zero so that from equations 59, 
41 and 42 


bth s9 or 2 cosu — exp (u) = 0. - (71) 


This is satisfied if » < 0.54 which gives positive value for equation 71 
as well as for the upper edge of the lock range g,/2M. Thus 0.19 S$ pS 
0.54 so that for f,, = 2125 Hz 


182 < f. S 542 Hz. (72) 


The bounds on f,/fm are loose in the sense that it is assumed that the 
full lock range is available to input frequencies. This is not the case 
for underdamped systems, where a hysteresis effect known as “cap- 
ture” reduces the effective lock range available, and so the bound 
might be tightened accordingly. 


6.4 The Capture Phenomonen Apa 

When the PLL is “out of lock” its output voltage oscillates be- 
tween 1 and 0. The “pull-in” frequencies are those frequencies furthest 
removed from the carrier for which the PLL will ultimately lock. In 
general, the PLL will exhibit a hysteresis so that’ the pull-in range 
will be smaller than the lock range; also, the upper and lower pull-in 
frequencies will generally not be symmetrically distributed about the 
carrier f,,. In general, solution of the capture range (for conventional 
phase-locked-loops) results in a nonlinear integrodifferential equa- 
tion. Solution for conventional PLL’s requires phase plane techniques 
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and is documented in Ref. 2. An analysis to determine capture range 
for the digital PLL has not been performed. 


6.5 Stability of the Second Order System Within the Lock Range 


Thus far there has been no restriction placed on N, the number of 
counters required per shift register. For N less than some critical 
value N,,, it is possible for stable modes of operation, different from that 
of Fig. 2, to exist. These modes do not exploit the PLL to its fullest 
advantage and thus in previous derivations it has been assumed that 
N 2 N,,. 

A possible parasitic mode is shown in Fig. 12. The input frequency 
is constant, as is the frequency of the flip-flop signal, E;, but the 
waveform of H; is no longer square. This operation is caused by the 
limited length of the registers. Assume that the lengths (number of 
counter stages) for the n = 2 system are equal, and of value N. Each 
time the feedback flip-flop (see Fig. 1) is triggered, register 1 is reset 
to zero. During the period 7,+7. (see Fig. 12) this register is ad- 
vanced to a count of [T1g1+Te2f1] modulo C where C = 2%. This num- 
ber is shifted into the second register where one of two conditions may 
exist: 


Tig, + Toft AV, 


ee 


Eo 


sl etal be | 
sl te ke 


Fig. 12 — Waveforms for parasitic locked mode of second order PLL. 
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or 


Ti9: + Tefi S M 


The first equation results in normal operation, analyzed in previous 
sections. During 7’, the count is reduced by a negative fe so that 


[T191 + T of, + T fel mod C = M. 


The second equation requires that the second register count down to 
zero (which is congruent to C) and then further reduce the count by 
C — M. Thus 


[7191 + T of, + T sfo] mod C — M — Cc. 


These equations may be combined by introducing the parameter § = 
—1, 0 depending upon the mode of operation: 


Ti: + T of, + T sho = M = 6C. (73) 


Similarly, an equation similar in form to equation 73 covers the in- 
tervals T., 73, and T,: 


Ts. + Taf. + Tofe = M — ¥C, (74) 
where y = 0, 1. 
The period, 72, may be explicitly solved for by noting that 


1.+T,= T, + Ts; = p. (75) 
Thus 
a er 
: p © i ae —h 
ro 6C + Mf — (-v8 + Wh — 9) 
Se eg 76 
fi~ G2 — g:)° j 76) 
Under normal conditions 6 = y = 0 so that 
_2Mf — ~ 9i_ 
2 = 
hh + fe a gi’ 
which is identical to equation 57. The parasitic mode of operation is 
given by 6 = —1, y = 1 so that equation 76 may be rewritten as 
2Mf — 9 2Ci 
= ee 7 
a ofc ee 7) 


Graphical examination (see Fig. 13) of equations 57 and 77 shows that 
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OUTPUT VOLTAGE, V => 





fitfe 3, 
2M 2M 
INPUT FREQUENCY, f —> 


Fig. 13 — Comparison of parasitic and normal mode voltage versus frequency 
characteristic for second order PLL. 


two modes of operation may exist within the lock range. To insure that 
this will not happen, C must be chosen so that the frequency range for 
which 0 S v2 S 1 lies outside the normal lock range. Thus at the upper 
lock range edge, f = g,/2M, it is required that v. > 1 so that from 


equation 77, 
20( 2) 
2M >] 


fy or = Jo : 


Hence 


C fit fi fe 
—s > a 
M 91 
and thus the minimum number of counters required is 


N,, = 1 + integer value | toe (tte t)ae | (78) 


VII. PERFORMANCE 


In a laboratory test, an FM digital signal with mark and space fre- 
quencies of 1270 and 1070 Hz was fed into ann = 1 PLL which had 


* That is, the integer value of 23 is 2. 
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an 8-stage counter and clock frequencies for which g, = 2M(970)Hz 
and f, = 2/M(1370)Hz. The output of the exclusive-or was fed to an 
amplifier which clipped the signal to precise levels and provided a con- 
stant output impedance. The signal then went to a Butterworth low- 
pass filter with n = 3 and cutoff of 200 Hz. The filtered signal was sliced 
to give a digital output which could be compared with the original 
digital input. The FM modulator used in these tests was from a Bell 
System Data Set 10311. The eye pattern for this circuit with an input 
at 300 baud is given in Fig. 14a. When the lock range was increased to 
+300 Hz about the carrier, the eye pattern improved, as shown in Fig. 
14b. This effect results from the increased bandwidth, f, . 

Start-stop distortion, as measured with a Bell System 911A data test 
set for bit rates of 150 to 300 baud, was 2 and 5 percent, respectively. 
Performance with additive gaussian noise is as shown in Fig. 15, 





‘(b) 


Fig. 14—-First order PLL eye patterns for 300 baud random digital data. 
Lock range of (a) 400 Hz, (b) 600 Hz. Horizontal scale: 0.5 ms per cm. 
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START -—STOP DISTORTION IN PERCENT 
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SIGNAL- TO- NOISE RATIO IN DECIBELS 


Fig. 15— Distortion performance of first order PLL with input signal de- 
graded by 3 kHz band-limited white noise. 





(b) 


Fig. 16— Second order PLL eye patterns for (a) 150 baud and (b) 300 baud 
random digital data. Horizontal scale: 1 ms per cm. 
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where start-stop distortion is plotted versus signal-to-noise ratio. The 
noise is 3 kHz band-limited white noise and the bit rate is 300 baud. 
This performance is similar to that of the receiver in Data Set 103E1. 
Start-stop distortion for a single frequency (2025 Hz) interference 
was measured and, as might be expected, single frequency interference 
is a strong function of frequency, the worst values being at odd multi- 
ples of the channel frequency. 

Laboratory tests indicated that the second order PLL, with char- 
acteristics described in Section VI, performed in accordance with 
theoretical expectation. The eye patterns for an input of a modulated 
random data signal at 150 and 300 baud are shown in Fig. 16. It was 
found that noise performance of the second order system was not 
much better than that of the first order system. This probably resulted 
from the counter length falling just short of the value indicated in 
equation 78, thereby allowing noise perturbations to randomly shift 
operation between the stable and parasitic modes of operation. 


VIII. CONCLUSIONS 


A synthesis procedure for the realization of an nth order digital 
phase-locked loop has been described. Such systems find application 
in FM demodulators, filters, and in extremely stable locked oscillators. 
Various loop properties have been theoretically derived and experi- 
mental performance has been found to be consistent with these results. 
The advantages of such circuits for use in large multichannel data 
sets are: 


(t) Filtering property—(6n) dB per octive. 

(wz) Small size—completely integrable using one or more mono- 
lithic chips. 

(wiz) Requires no adjustment—permitting lower manufacture and 
repair costs. 

(iv) Excellent stability and reliability—the PLL circuit either 
works or does not, since it is completely digital. Stability of the en- 
tire system is dependent upon clock stability which may be as good 
as required. 

(v) Multichannel economy—accurate clocks can be used to drive 
many channel circuits. 

In addition, this circuit has two inherent advantages over other 
types of phase-locked loops. First, it requires no low-pass filter gen- 
erally found between the phase comparator (multiplier) and voltage- 
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controlled oscillator in conventional phase-locked loops. Second, it 
includes, in effect, an ideal voltage-controlled oscillator, the frequency 
of which is linearly related to voltage. 
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First and Second Passage Times 


of Sine Wave Plus Noise 


By A. J. RAINAL 
(Manuscript received June 25, 1968) 


This paper is concerned with the first and second passage times of a 
stationary random process, I(t, a), consisting of a sinusoidal signal of 
amplitude (2a)* plus stationary Gaussian noise with a finite expected 
number of zeros per unit time. This type of random process is present 
at the output of the IF amplifier of a radio or radar receiver during the 
reception of a sinusoidal signal immersed in Gaussian noise. Approximate 
integral equations are developed whose solutions yield approximate proba- 
ability densities concerning the first and second passage times of I(t, a). 
The resulting probability functions are presented in graphs for the case 
when the frequency of the sine wave is located in the center of a band of notse. 
Related results concerning the approximate distribution function of the 
absolute minimum or absolute maximum of I(t, a) in the closed interval 
[0, 7] are also presented. 


I.. INTRODUCTION 


Exact, explicit, results concerning the first passage times of a Markov 
or ‘“‘Markov-like” random process have been given by many authors.*~” 
But very little is known about the first passage times of a random 
process consisting of a sinusoidal signal plus stationary gaussian noise. 
This random process is of interest because it serves as a realistic model 
for the output of the IF amplifier of a typical radio or radar receiver 
during the reception of a sinusoidal signal immersed in Gaussian noise. 

Let Z(t, a) denote the stationary random process consisting of a 
sinusoidal signal of amplitude (2a)? and angular frequency gq plus 
stationary gaussian noise, J y(t), of zero mean and unit variance. Thus, 


I(t, a) = (2a) cos (gt + %) + In(t). (1) 


% denotes a random phase angle which is distributed uniformly in 
the interval (—7z, 7). a denotes the signal-to-noise power ratio. 
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The first and second passage times of J(é, a) are indicated in Fig. 1 
and are defined as: 


(i) r* represents the time I(t, a) takes in going from an upcrossing 
of the level J, to the first crossing of the level J, < I, . 

(iz) +” represents the time J(é, a) takes in going from a downcrossing 
of the level J, to the first crossing of the level J, < I, . 

(zi) r+ represents the time J(f, a) takes in going from an upcrossing 
of the level J, to the second crossing of the level J, < I, . 

(zv) 7 represents the time J(é, a) takes in going from a downcrossing 
of the level J, to the second crossing of the level J, < J; . 


For fixed J, , Iz < I,, and a we denote the probability densities of 
ag 7,7}, and 7, by Wr, I,,I,,a),W (7,1, , 12,4), Wir, I, ,I,,4), 
and Wi(r, I,, Iz, a), respectively. These four probability densities 
arise in many branches of science and technology. 

Because the random process I(t, a) is symmetrical about its mean 
value of zero, we need only discuss the case when J, > J, as is indicated 
in Fig. 1. The case when J, < J, can always be converted to the case 
under discussion by considering the random process —J(t, a). 

For the general process I(t, a), exact, explicit expressions for the four 
probability densities are unknown. However, as already mentioned, 
exact, explicit results concerning the first passage times of I(t, a) are 
known for the special cases when a = 0 and Iy(é) is a Markov or Markov- 
like random process. 

The purpose of this paper is to present theoretical approximations for 
the four probability densities of the first and second passage times of 


$i AND 6, ARE AXIS-CROSSING 
| INTERVALS AT LEVEL I, 
| — 210, e-- 


LEVEL I, 


I(t, a) 





Fig. 1—7+* and + are the first passage times of I(t, a) = (2a)1/2 cos (qt + 60) 
4 Iy(h) defined by the levels Li and Jz. Similarly, 7:* and 7+ are the second 
passage times. 
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I(é, a) for the cases when a > 0 and Iy(¢) is a stationary, gaussian process 
having a finite expected number of zeros per unit time. 


II. AUXILIARY PROBABILITY FUNCTIONS 


Using a notation consistent with Ref. 8, we define the following 
auxiliary probability functions concerning the stationary random 
process I(t, a): 


(t) P3"(7, I, , Iz , a)dr, the conditional probability that a downward 
crossing of the level J, occurs between ¢ + 7 and ¢ + 7 + dr given an 
upward crossing of the level J, at ¢. 

(it) Py*(7, I, , I2, a)dr, the conditional probability that an upward 
crossing of the level J; occurs between ¢ + 7 and? +7 -+ dr givena 
downward crossing of the level J, at ¢. 

(tit) P >~(r, 1, , Iz , a)dr, the conditional probability that a downward 
crossing of the level J, occurs between ¢ + 7 and? + 7 + dr given a 
downward crossing of the level J, at t. 

(iv) P3*(r, I, , I2, a)dr, the conditional probability that an upward 
crossing of the level J, occurs between t + 7 and t + 7 + dr given an 
upward crossing of the level J, at ¢. 


These auxiliary probability functions were given in Ref. 8 for the case 
when J, = J,. Here, we need to merely generalize to the case when 
I, # I,. The reader should see Rice’s work for the definition of all 
notation which is not defined in this paper.* When a > O and J, ¥ I, 
Rice’s equation 38 generalizes to: 


Pz (r, I, ’ I, ’ a) 


T ve) 0 
=p, f af anf anno, u.b.n) @ 
—* 0 —o 
where 


N;, = Rice’s equation 2.7 of Ref. 9 for the expected number of up- 
crossings (or downcrossings) of the level J, per second 


_ (6)? exp (~Ii/2) y= D"Gn)! 1)"(2n)! (a (2)" 


Qr n=0 2"(n 1)? 


1 i 1 A 
-F(—n LI ie Ti) p(—4 >” + ie —2f) 


if, = the confluent hypergeometric function 
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pd, , If, 15,1.) = (22) °M 
“exp {sh [MoA(T?? + 14?) + 2Mor TT, + 2D,0i + 2H1, + ra} 


Mos 4a 43 
Mo Q oi (2 a) 


D, = M,,(1, — Q cos 6) + M,,[Q cos (qr + 0) — 12] 
+ M,..Qqsin 6+ M.,Qq sin (qr + 86) 
E, = M,.{Q cos (qr + 6) — I.] + M,3(1, — Q cos 4) 
+ M..Qqsin (qr + 0) + M.3Q¢q sin 0 
F, = My {Ii + I2 — 2QU, cos 6 + I, cos (qr + 4)] 
+ Q’[cos’ 6 + cos’ (qr + 6)]} 
+ 2M,.Qq{U, — Q cos 6] sin 6 + [Q cos (gr + 0) — I,] sin (gr + 86)} 
+ 2M,;Qq{{I, — Q cos 6] sin (qr + 0) + [Q cos(q7r + 6) — I] sin 6} 
+ 2M,,.[7, — Q cos 6][J2 — Q cos (qr + 4)] 
+ Mzz(Qq)'lsin’ 6 + sin’ (qr + 6)] + 2M@22(Qq)’ sin @ sin (qr + 6). 
The M’s are given in Ref. 4, Appendix I with 


i= 


He -= : "GY ad eay. (3) 


where W(f) = one-sided power spectral density of J,(é). Also, 
B = —m’’(0). The primes denote differentiations. 


Equation 2 can be put in the form: 
Py(r, qy ’ i, ) a) = [40°N 7.) °M22(1 = m7? 
[exp (—G/2M) Je, hy ba) 9) 
where 
an 1 he He - a : 
Jr, hs , ks) = mi — 7 J; dx . dy(x — hs)(y — ks)e 


ety — nay 
2(1 — 7) 


ae 
he = Mza(l — ri} [Ds — rb | 


22 


.= 
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i 3 
ks = —M>{1 ey Fil. [Es = Da =| 


Mo» 
G, = M3,(1 = i (2r, DH — D oe E3] -- F, . 


Py* (7, I, , Iz, a) is obtained from equation 2 by changing the signs 
of the ’s in the limits of integration. We find that P;*(r, I, , 2°, a) is 
equal to the right side of equation 4 with hs , ks replaced by —h3 , —kz . 

Py (7, I, , Iz, a) is obtained from equation 2 by changing the upper 
limit of integration of I/ to — ©. We find that PZ" (r, I, , Iz , a) is equal 
to the right side of equation 4 with the function J(7, , hs , ks) replaced 
by the function J,(7; , hz , ks), where 


1 —o ao : 
Tits sha sks) = qr fae | dy @— hy — ke) 


PZ*(r, I, , Iz, a) is obtained from equation 2 by changing the lower 
limit of integration of If to -+- ©. We find that Pi*(r, I, , I. , a) is equal 
to the right side of equation 4 with the function J(r, , h3 , k3) replaced 
by the function J,(r, , —h; , —ks). 

The functions J(r, , h3 , ks) and J,(r, , hs , ks) are expressed in terms 
of Karl Pearson’s well-known tabulated function (d/N) in Ref. 10. 

A considerable simplification occurs when a = 0. For this case we find 


Pz (r, I, ’ I, ’ 0) = M,.67*(1 as m?)? 
2 2 2 
emp |B — GAR — Bahl) ly, 3,,%) (6) 


2 211 — m’) 
where 
h = —m' (ml, Tr I.) 
: [M2.(1 — m)} 
ks Bs m’'(L, = mI») 


[M.(1 — m)]* 


Equation 6 reduces to Rice’s* equation 47 when J, = I, = I. 
III, APPROXIMATE RESULTS VIA INTEGRAL EQUATIONS 


3.1 Probability Densities 


Let us assume that each of the random variables 7°, 7, 7%, and rj 
(see Fig. 1), is statistically independent of the sum of the following 
(2N -++ 2) axis-crossing intervals at level J, when N = 0,1, 2, --+ . Under 
this ‘‘quasi-independence” assumption, approximate theoretical results 
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for the probability densities of r*, 77, 7%, andr], namely W*(z, [,,I2,a), 
W (7,1, ,l2,a),W%(7, tJ, , I, ,a), and Wy(7, I, , I. , a) are given by the 
following integral equations: 


PGs hg dee) = WG Tip 1 
+ W'r,1,,1I2,a* P27 (,I,,1I.,4@ (7) 

Pr(7,1,,l2,@ = W(s,1,,l2,@ 
+ W(r,1,,1,,a) * Pr (7,1, ,12,4@) (8) 

ght ding dog). = Wid 98) 
+ Wir, 1,,I2,a * P27, 1, ,I2, 4) (9) 

Pe G44 135 @ = Wiles de 133) 
+ Wir, 1, ,I2,a)* Pr*(7, I. , Ie, a). (10) 


The P.’s are the auxiliary probability functions presented in Section 
II, and the symbol * denotes the convolution operator, that is, 


feg= | Hog — Dat. (11) 
From symmetry we have 
Pr'(r, Ine) = P(r, eee ee) (12) 


Integral equations 7 through 10 are analogous to McFadden’s 
equation 47 and Rice’s equation 84 in Refs. 11 and 4, respectively. 

Let us define two additional probability densities defined by the 
first and second passage times of I(t, a) between the levels J; and Iz: 

(1) W(r, 11, I2, a) dr, the conditional probability that the first cross- 
ing of the level J, occurs between ¢ + + and t + + + dr given a crossing 
of the level J, > I, at t. 

(wt) Wy(+, I1, Iz, a)dr, the conditional probability that the second 
crossing of the level I, occurs between ¢ + 7 and t + + + dr given a 
crossing of the level I; > Iz at t. 

Clearly, we have 


Wr, I, ’ I, ’ a) _ WwW" (7, I, ) I, ? a) i aW (1, I, ’ I, ’ a) (13) 
Wir, I, ) I, ? a) me aWi(r, I, we ? a) a 3W (7, I, ? I, ] a). (14) 


3.2 Absolute Minimum In a Closed Interval 


Let us define the following probability functions concerning the 
absolute minimum of J(é, a) in a closed interval [0, 7]: 
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F*(r, 1, ,I2,@) 
= Pr {min I(t, a) > I, | 10, a) = I, > I, , I’, @ > 0} (15) 


Ostsr 
F(r, I, » Iz , a) 
= Pr {min I(t, a) > I, | (0, a) = 1, > I. , I'(0, a) < 0} (16) 


Ostsr 


F(r, 1, ,I_,a) = Pr {min I(t, a) > J, | 10, a) = 1, > I} (17) 
Ostsr 


where Pr{-} denotes the probability of the event inside the brace. 
Clearly, we have 


Phot) | Witt. bo wae 


(18) 
=1-[ WGL,h,@dr 
0 
aCe hee ee ee / WG i hc ale 
(19) 
z i-f WG Tis Tay de 
0 
Fr, Ty Ln, 0) = [| Wer, 1, 1s, 0) de 


: (20) 
1- | WG te wan 
0 


Because we are discussing only the case when J; > Iz as is indicated 
in Fig. 1, we discuss only the probability functions concerning the 
absolute minimum of I(t, a) in a closed interval [0, +]. The corre- 
sponding probability functions concerning the absolute maximum of 
I(t, a) in a closed interval [0, 7] are associated with the case when 
I, < Ig, and they can be obtained from symmetry by considering 
the random process —I(t, a). 


IV. RESULTS FOR SINUSOIDAL SIGNAL CENTERED IN LOW-PASS NOISE 


For purposes of computation we set the angular frequency, q, of the 
sinusoidal signal in the center of a band of gaussian noise with an 
ideal low-pass power spectral density of cutoff frequency fy. Thus, 


g = who (21) 
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and 
-1 
wy) = { OSf Sho 02) 
0 otherwise. 
Accordingly, from equation 8, 
sin 2rfor 
m ) =e Qrfor (23) 


From equation 23 we see that it is convenient to define normalized 
time aS Up = 2z7for. All our results are plotted with respect to normal- 
ized time Up. 


4.1 Experimental Verification of Auxiliary Probability Functions 


The auxiliary probability functions for the case when I, = J, = O are 
useful for studying the zero-crossing intervals, the axis-crossing intervals 
defined by the level J, = I, = 0, of I@, a). Figures 2, 3, and 4 present 
PZ” , Py* , Pi* , and Pz" for the case when J, = I, = O anda = 0, 1, 
and 4. These results were computed by using Simpson’s rule. The results 
compare satisfactorily with the initial behavior of the experimental 
probability densities presented in Figs. 34, 35, 42, 48, 46, and 47 of Ref. 
12. Notice that the experimental probability densities pertain to the case 
when the power spectral density of the noise is 


7 
1+) 


rather than the power spectral density defined by equation 22. 


Wf) = (24) 


4.2 Results When a = O and a = 1 

Figures 5 through 13 present the results when a = 0, signal absent, 
and l,j = 1, fg = 0; 4, = 1,l, = —1; and I, = 0, Iz = —1. The Po’s 
were computed by using Simpson’s rule, the integral equations defin- 
ing the W’s were solved numerically by using the trapezoidal rule, 
and the F’s were computed by using Simpson’s rule. 

Similarly, Figs. 14 through 22 present the results when a = 1, signal 
present, and 4, = 1, fl, = 0; 1; = 1,[, = —1; andi, = 0, I, = —1. 

As an example of the interpretation of these results, we see from 
Fig. 16 that the median time, r,, for the random process I(t, 1) to 
-go from the level J; = 1 to the level J, = 0 for the first time is given by 


Uo = 20 foTm =f (25) 
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Fig. 2 (also Figs. 3 and 4) — Plots of the probability functions Pa*(w, hh, Is, 
a), Px*(w, Ix, In, a), Patt(w, Li, In, a), and Ps-(uo, i, In, a) associated with 
the crossings of the levels J: and J. by a stationary random process consisting of 
a sinusoidal signal of frequency fo/2 plus stationary gaussian noise with autocor- 
relation function m(7). a denotes the signal-to-noise power ratio. 
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Fig. 3— (See Fig. 2.) 
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Fig. 5 (also Figs. 8, 11, 14, 17, and 20) — Plots of the probability functions 
Ps~(uo, 1, In, a), Wo(o, In, Ie, a), Pa (uo, Lr, In, a), W*(wo, Lr, Is, a), and 
W(u, i1, I2, a) associated with the crossings of the levels J; and Js by, a sta- 
tionary random process consisting of a sinusoidal signal of frequency fo/2 plus 
stationary gaussian noise with autocorrelation function m(r). a denotes the 
signal-to-noise power ratio. 
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Fig. 6 (also Figs. 9, 12, 15, 18, and 21) — Plots of the probability functions 
Py*(w, Ih, Is, a), Wr(w, ih, Tn, a), P3** (uo, ih, Ta, a), W:3* (uw, ih, In, a) and 
Wil(uo, 11, J2, a) associated with the crossings of the levels J; and Jz by a sta- 
tionary random process consisting of a sinusoidal signal of frequency fo/2 plus 
stationary gaussian noise with autocorrelation function m(r). a denotes the 
signal-to-noise power ratio. 
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Ug=2T7 fot 


Fig. 7 (also Figs. 10, 13, 16, 19, and 22) — Plots of the probability functions 
Ft(w, Ia, Ia, a), Fr(w, Lh, Ie, a), and F(w, lL, Iz, a) associated with the 
crossings of the levels J, and Jz by a stationary random process consisting of a 
sinusoidal signal of frequency fo/2 plus stationary gaussian noise with auto- 
correlation function m(r). a denotes the signal-to-noise power ratio. 
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Fig. 9— (See Fig. 6.) 
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Fig. 10 — (See Fig. 7.) 
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Fig. 11— (See Fig. 5.) 
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Fig. 13— (See Fig. 7.) 
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Fig. 15 — (See Fig. 6.) 
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Fig. 16— (See Fig. 7.) 
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Fig. 17 — (See Fig. 5.) 
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Up =27 fT 
Fig. 18 — (See Fig. 6.) 


4 6 8 
Up= 2afoT 


Fig. 19— (See Fig. 7.) 
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Fig. 21— (See Fig. 6.) 
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Fig. 22 — (See Fig. 7.) 


or 


Ta = (20). (26) 


tm also represents the median time during which the random process 
I(t, 1) remains continually above the level Jz = 0 when it starts at the 
level I; = 1. From symmetry, 7, also represents the median time dur- 
ing which the random process I(t, 1) remains continually below the 
level J, = 0 when it starts at the level J, = —1. 


V. CONCLUSIONS 


The exact auxiliary probability functions can be used in approxi- 
mate integral equations in order to deduce approximate probability 
densities of the first and second passage times of sine wave plus sta- 
tionary, gaussian noise with a finite expected number of zeros per 
unit time. These approximate probability densities can be used to 
deduce the approximate median times associated with the first and 
second passage times. Also, the approximate probability densities of 
the first passage times can be used to deduce approximate distribu- 
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tion functions for the absolute minimum or absolute maximum of sine 
wave plus noise in a closed interval. 


VI. 


The corresponding exact results are not yet known. 
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The Spectrum of a Simple 
Nonlinear System 


By S. C. LIU 
(Manuscript received July 3, 1968) 


The random motion of a particle with nonlinear damping is investigated. 
The spectrum of the velocity of the particle ts obtained by solving the as- 
sociated nonstationary Fokker-Planck equation and also by using the 
equivalent-linearization technique. The first procedure yields an exact 
solution in terms of Laguerre polynomials. The second leads to simple, 
approximate results which are valid for cases where the small nonlinearity 
assumption holds. Results obtained by these two methods are compared 
and good agreement is observed over a large frequency range. 


I. INTRODUCTION 


The recent advance in space and communicational technologies has 
led engineers to numerous difficult but fascinating problems in regard 
to the structural dynamics in random environments. For example, 
Hempstead and Lax have investigated noise in self-sustained oscilla- 
tion; ? Ariaratnam and Sanker have studied the dynamic snap- 
through of shallow, arch-type aircraft components under stochastic 
pressure.* In this paper the random vibration of a simple mass with 
nonlinear damping is studied. The nonlinearity of the system is in- 
troduced to linear viscous damping by adding to it an extra term 
which is inversely proportional to the first power of the current ve- 
locity. Emphasis of the analysis is placed on finding the power spectral 
density of the random motion. 

Two different approaches are used to obtain the desired solution. 
First, the exact spectrum is found by solving the associated nonsta- 
tionary Fokker—Planck equation in terms of the eigenfunction expan- 
sion of the degenerate ordinary differential equation. Second, approxi- 
mate solutions are obtained by using the equivalent linearization 
technique by which the original nonlinear system is converted to an 
equivalent linear one. The equivalent linear system, constructed by 
the least mean square error criterion and based on the small non- 
linearity assumption, is then solved by standard linear theory. 
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II. NONSTATIONARY FOKKER-—PLANCK EQUATION 


Consider the first-order nonlinear system described by the follow- 
ing differential equation: 


+ F@) = 7, (1) 


which may be thought of as the velocity equation of a unit mass with 
nonlinear damping F(x) subject to a force f(t). 

Let us discuss the problem of obtaining the power spectral density 
of x(t) when f(t) is a random process. We limit the discussion to 
the case where f(¢) is a stationary white gaussian process with the 
first two moments defined as 


f@) = 0 (2) 
and 
(f(t) f()) = 21s, 6(r1 — te) (3) 


where s, is a constant, the symbol ( ) indicates the ensemble average 
and 6 indicates the Dirac delta function. 

Caughey and Dienes* have investigated a similar problem for 
F(z) = k sgn x. We shall however consider a different case in which 


Fg) = Bx — 7 (4) 


0<2r< ow 


where 8 is a constant and y is a smaller nonlinear coefficient. In case 
y = 0, equation 1 becomes the familiar linear differential equation. 

The Fokker-Planck equation which governs the transition proba- 
bility p(x, | x, 7) with given initial velocity 1, = x(t.) for the velocity 
x(t) at time ¢ is 


‘a OE et a’p 
p= ee | (22 Yo | + 78, Ox? (5) 


where + = ¢ — ¢,. The initial and boundary conditions for equation 
5 are 


Di, = 5 — 2) (6) 
and 


pO, t) = p(~, t) = 0, (7) 
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respectively. As the time of passage t becomes sufficiently large, 
p(X | x, r) in equation 5 approaches a stationary value ps:(x) inde- 
pendent of ¢ and initial condition 6. Setting » = 0 in 5, such a station- 
ary density can be found by solving the degenerate stationary equa- 
tion. The result is: 


Ds = C exp | -+ (B22 — ¥ log x) | (8) 


where C is the normalization factor determined by 


/ er eos (9) 
0 


The power spectrum is the Fourier transform of the autocorrelation 
function which is determined by the joint probability density p(%p, 
x, 7). Thus from the relation 


p(x. > &, 7) a Dii(t,)p (Xo | x, 7); (10) 
we need to find the transition probability density, that is, the non- 
stationary solution of equation 5. Let p(x, | x, 7) = T(t)X(x) in 5. 
It follows that 





T(t) + »T(t) = 0 (11) 
and 
; 2 2, xX r 
ee tEen- SEG) +ax-0 08 


where o? = xs,/B. If X,,(x) is the eigenfunction and X,, the correspond- 
ing eigenvalues satisfying equation 12 and the prescribed boundary 
condition 7, it can be shown that® 


= X m(X)X m(Lp) eg hmtimte) 7 





= 1 
D(x, | w, 7) pa Det(Zo) ( 3) 
In deriving 13, the following orthogonality condition has been used: 
dx 
[ x.@x,@) a te (14) 


Following the transformations adopted by Stratonovich, let p = 4 
(y/78o — 1), 2 = 27/207, and u = z#X. Equation 12 becomes 


G2 tuto Ww), 9 as 
Zz 


au . du | 
az” +r dz +r Zz 
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Equation 15 is a degenerate hypergeometric differential equation which 
has eigenfunctions U,(z) with corresponding eigenvalues A, = 2np: 


U,(2) ea ge LEO (a) (16) 
where 
a pe) aM Za a —Z2 nt a 
Lez) =F1e% ae (e*2"* “) (17) 


is the Laguerre polynomial of degree n. 
Transforming back to the original variables and applying equation 
14, we obtain the following normalized eigenfunctions: 


(2) iia al ies) 
o (ntT(n + Qu + DTQn + DP 


From equations 13 and 10 the transition density and jointly density, 
respectively, can be found as 


X,(z) = (18) 


_ (ayk gat /p~# CO Aca) Acad OT . 
(ee |) 1) = “Tangy Zenlt@ + oe + 1° uo 


and 


2 - 
p(x. y vy 7) = = (zz, yarn (2+29) 


= Le’ (2) LE" (z,) —2n8lr| 





alta + Ou + DPCe FD e (20) 
where 
2 2 
z= ea and 2 = a 


From the above the autocorrelation function R,(7) of x(t), where 
r=t— t,1s 


Rr) = i; i p(x, , x, 7)x,x2 dx dx, 
, en enBi rl 


Dan + 2u + 1)TQu + In! 


Ms 


= Q¢” 


I(u,n) (21) 


n 


1 
Oo 


where 


IGeny | i (g)2## U/ 9-#(g RHE 207 2 (TOM) de dz, . (22) 
0 0 
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In the appendix we show that 
id maak ad Pad) a= —Tz am (8/2)1T|n = (1/2)) (23) 
0 Wr)n! 
from which it follows that 
J Ei + (8/2)|P{n — aay (24) 
4a 


I(u,n) = 7! 


Substituting 24 into 21, we obtain the following expression for the 
autocorrelation function R,(r): 








os eee Zt (3/2)] Tin = (1/2)Ie —2nBlr\ 
R.(r) = TQn +1 > @)T@+m@+p? ©) 


which is a monotonically decreasing function of r. 
The power spectral density of x(t) can be derived from equation 
25 as the following: 








se) = = [ ; Ree eae oT + ee 5(.s) 
o\ T'[2u + (3/2) T{n — (1/2)] sn - 
ce) P(2u + 1) ya (n!)?P(n + 2u + 1) 4n7B? + w* (26) 


Notice that S,(w) is again a monotonically decreasing function of 
w and has a spike at w = 0. 
The nonstationary mean value of x(¢) is given by 


(x(7)) = : apis | esa) aa: (27) 
0 
Using equations 19 and 23 it can easily be shown that 


oT (2u + (3/2)] & In — (1/2)]L2" &,) pooner 
(a(r)) = —1/%n Loe) = (n!)" T(n + Qu es 1) ° . (28) 


Because x = (207z,)%, we notice that (x(r)) depends on the initial 
velocity x. As r > o, (x(r)) in equation 28 approaches its stationary 
value (x), which is given by 


g,, = Oe + GP, as 


(x).¢ 1s independent of the initial condition x,. This stationary mean 
velocity, (x), can also be found by using the stationary density 
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pet(x) as follows: 


(t)at = [ : psx) dx 


[ ° LX (x) dx 


i (2)4o us Qut (1/2) —* 7 Qn 
= TOe ED “ee (2)°** OM e"* E22) dz 
(2)'T[2u + (3/2)] | 

T(2u + 1) 


The variation of (z(r)) is illustrated in Fig. 1. It has a maximum value 
x, at rt = O and decreases exponentially to the stationary value (x)., 
as given in equation 29. The nonstationary mean square value of x(t), 
given its initial value 23, is difficult to evaluate explicitly, but its 
stationary value (z’),, can be found by integrating S,(w) in equation 
25 over the entire frequency range from — » to + ~. By this procedure, 


» _ f* _ 2 Pu + (8/2) ‘In — (1/2)] 
fu =f Sle) de = oe ate et 
(30) 


As expected, (x),¢ is also independent of the initial condition v,. The 
variations of (),¢ with the nonlinear coefficient k = y/zs, are shown 
in Fig. 2. 


€X> st 


MEAN VELOCITY <2(T)> —> 


TIME, T —> 


; Fig. 1— Nonstationary mean velocity of nonlinear system subjected to random 
orce. 
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Fig. 2—Comparison of mean square velocity nonlinear system subjected to 
random noise. 


III, EQUIVALENT LINEARIZATION TECHNIQUE 


The eigenfunction expansion of the degenerate ordinary differential 
equation of the governing Fokker-Planck equation is often difficult 
to find. In such cases it is sometimes convenient to use the perturba- 
tion®’’ or the equivalent linearization®’® techniques to obtain desired 
quantitative results. If the nonlinearity of the system is small (that is, 
v/B << 1), these techniques provide the simplest means for obtaining 
approximate results. In the following, an equivalent linearization pro- 
cedure is used to derive the power spectral densities of the nonlinear 
velocity x(t) in equations 1 through 4. Let 


+ Boxe + ex). = f® (31) 


be the equation of motion of a system equivalent to that described by 
equations 1 and 4 in which £, is the equivalent linear stiffness and 


ea) = Br +% — Bix (32) 


is the error function. If «(x) is small and may be ignored, equation 
31 becomes a linear differential equation, and its spectrum can be 
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solved by standard technique. The equivalent linear stiffness B, can 
be determined by the criterion that the mean square error is mini- 
mized. The mean square error is 


(e(x)”) = [ | — B,)°2? + a + 2(6 — 8. foes dz. 
Setting d(e(x)?)/08, = 0 we obtain 


[ Bx"p(a) dx = [ B.x"p(x) dx — [ yp(x) dx; 
therefore 
g [ x(x) dx + 


i [ 7 x'p(x) dx 


= md Stee 
The following two cases are considered: 
(t) Assuming p(x) is gaussian, that is, 


_ (2n’s,\7* (-8) 
ple) = (7£%) ° exp (8), 


(1?\q = 02 = Te. (34) 








then 





Substitution into equation 33 yields 
B..@ = BAL + ¥/m8,). (35) 


(1) Using true (stationary) distribution, p,;(x) given by equations 
8 and 9 becomes: 


pi ot exp [Ge Pes.) (36) 


e) 2 
= (y/m 80) — Bx ) 
A [ z exp ( One, dx 


1 Grrr is fea |, 
2 B 2 


where 
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Therefore the mean square value of x(t), using true distribution 
equation 36, is 


(x) 9 


/ xp ,(x) dx 
0 


te (gyre exp (-3& 2’) as | 


(xs, + 7). (37) 





Dil bs | bo 


Substitution into equation 33 yields 
2 
os Gh = ene, Soe, Shade 
pr =A(1+—~) = a1 + 2-4.) Ge 
Comparison of equation 38 with equation 35 indicates that B,., g is 
the first-order approximation of Be, 7. 
Now let us consider the simple linear system 


é+ Bx = fo (39) 


whose transfer function is given as 








H(iw) = (40) 


oe nee 
B. + tw 

According to the familiar linear theory of random processes, the 
power spectrum of x(t) in equation 39 is given by 





S,(@) a B nar, (41) 


where s, is defined in equation 3. 
Substituting B., g as given in equation 35 into equation 41, we obtain 


S.,¢) = (42) 


8, 
BL + (y/ms.))? +o” 


Substituting @,, r as given in equation 38 into equation 41, we obtain 


S.,7(w) = (43) 


So 
[B(rs, + 2y)/(as, + y)]}° + 0” 
By setting y = 0, both equations 42 and 48 give 


So 


8° + 





S.@) = 
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which is the spectrum of the corresponding linear system. The mean 
square value and the power spectral density of the velocity response 
x(t), obtained by solving the Fokker-Planck equation, and the equiv- 
alent linear equations using gaussian and true distributions, respec- 
tively, are summarized in Table I. In this table k = y/zs, is the non- 
linear coefficient. 

The mean square velocities (x”), (x”)¢, and (x”)p are compared in 
Fig. 2. It is seen that for k < 0.035, that is, in a very small nonlinearity 
range, both linearization cases give higher mean square velocities than 
the exact solutions. For k > 0.035, equivalent linearization methods 
give larger results when using true distribution and smaller results when 
using gaussian distribution than the exact solutions. 

The power spectral density functions S,,¢(w) and S,,¢(w) obtained 
by the equivalent linearization procedure, using gaussian distribution 
and true distribution of z(t), respectively, are compared in Fig. 3 in 
which B, = (6°/s,)S.,¢(w) and B, = (8"/s,)S,,7r(w). Notice that both 
S,,¢(w) and S,,7(w) are monotonically decreasing functions, and the 
differences between them are negligible for small nonlinearity. 

The exact power spectral densities are compared with the equivalent 
linearized solutions in Fig. 4. For the exact solution of S,(w) as given 
by equation 25, the spike at w» = 0 is evaluated by normalizing S,(w) 
to an area equal to that given by Fig. 2, that is, 


() = i S.() dw = 3.053 for k = 0.01. 
0 


Curves shown in Fig. 4 are seen to be monotonically decreasing. The 
equivalent linearized systems have higher power spectra of x(£) in the 
low-frequency region and lower power spectra of x(t) in the high- 
frequency region than the actual nonlinear system has. 


IV. CONCLUSION 


It has been shown that the exact expression for the two-dimensional 
nonstationary probability distribution of a class of simple nonlinear 
systems can be found in terms of the spatial eigenfunction expansion 
of the governing Fokker—Planck equation. Equivalent linearization 
techniques can be very useful in generating approximate response 
statistics for certain systems having small nonlinearities. In Figs. 2, 
3, and 4 good agreement has been achieved in the comparison of the 
exact and approximate mean square values and of the power spectral 
densities of the nonlinear random response. 


Exact 
solution 


Equivalent 
linearization 
using 
gaussian 
density 


Equivalent 
linearization 
using true 
density 


*k= y/[T8o: 


TABLE [—CoMPARISON OF RESPONSE STATISTICS OF THE VELOCITY RESPONSE x(t) 


a(1 + k)* 


Mean square velocity, £ (x?) Power spectral density of velocity, & Sz(w) 
1 72u + (3/2)] © M[n — (1/2)] T[2n + (3/2)] 278 95(w) 
2 (Qh +1) no (n!)3T(n + 2n + 1) T(2u + 1) T(2u + 1) 
4: Bs > 4n T[n — (1/2)] 
Am am (n!)3T(n + Qu + 1) [4n? + (w/B8)?] | > 


1 
(1 + k)? + (w/8)? 


1 
[(1 + 2k)/(1 + k)}? + (w/8)? 


ALIDOTAA NOILOW WOGNVa 


6926 
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Fig, 3 — Power spectral density by equivalent linearization. 
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Fig. 4 —— Comparison of the exact and approximate power spectra. 
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APPENDIX 


Derwation of Equation 23 


The following formulae are used in the derivation of equation 23: 


i; & "PLE ai 


0 


_T@+)T@+n+D si, 





(<n, +4; +454) 


ni! Tle + 1) 
(Re 8B > —1, Res > 0) (44) 
where 
F(a, Bs 73 2) = oF 1 (o, Bs 3 2) (45) 
is a generalized hypergeometric series which is defined as 


pl (on 9&2 yt? Ay By » Be noe Ba 2) i Fray! (B,)2(Bo)x oon (B,)x k! 


in which 
_Tet+hs 
ee = Te) 
and 
(@), = MES. 4s) 


A special case for equation 45 is when z = 1: 


Tyla — a — 8) 
ky = aly — 8) 
(Rey > Re (a+ 8), Rey > Res > 0). (47) 


Using equations 44 through 47, the integral involved in equation 
22 can be evaluated as follows: 


Fa, B3y;1) = 


°° 
/ peer EN) dz 
0 


_ FR (/2) + Cn +n + 1) 


n! T(2u + 1) F(—n, 2u + 3; 24 + 1; 1) 
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_ Fw + B/2))TQn +n + 1 TQe+ Ye + 1+n — 2u — (8/2)) 


n! TD(2u + 1) V2Qu+1+n)r[2u+ 1 — 2 — (3/2)] 
_ TQu + 8/2)1t[n — (1/2)] _ —Thu + (8/2)] Tin — (1/2) 
n! T[—(1/2)] 2(m)? n! 


which is equation 23. 
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Statistical Analysis and Stochastic 
Simulation of Ground- 
Motion Data 


By S. C. LIU 
(Manuscript received July 31, 1968) 


The time variations of the root-mean-square accelerations, the auto- 
correlation functions, and the power spectral density functions of 12 strong- 
motion earthquake accelerograms are analyzed. The results indicate that: 
(t) strong-motion accelerograms of sufficiently long duration are stationary 
in the rms sense, (it) the stationary rms acceleration ts a good measurement 
of earthquake intensity, and (tit) the autocorrelation and power spectral 
density functions of strong-motion accelerograms resemble those of a nar- 
rowband process. Based upon these results, a method of determining the 
transfer characteristics of a site is introduced. A procedure for generating 
a filtered, gaussian stationary process to simulate ground motions ts de- 
veloped, and two applications of this simulation procedure illustrate its 
significance. 


I, INTRODUCTION 


For many years structural engineers have been concerned with the 
dynamic response of structural systems subjected to seismic excita- 
tions. Ground motions may be caused by natural earthquakes, by 
underground explosions, or by nuclear air blasts. Structures such as 
high-rise buildings, nuclear reactor facilities, or sensitive equipment 
in the vicinity of such events are vulnerable to induced random-type 
disturbances, Traditionally, deterministic methods of analysis relying 
on the known earthquake response spectra have been used.t These 
methods have provided valuable information regarding the behavior 
of structures during earthquakes. However, this procedure has a 
serious restriction in that only a few strong-motion accelerograms 
exist which provide ground-motion input. An earthquake is usually 
initiated by a series of irregular slippages along faults, followed by 
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innumerable random reflections, refractions, dispersions, and attenua- 
tions of the seismic waves within the complex ground formations 
through which they travel. 

Since ground motions are generally random, a probabilistic method 
of analysis appears to be more appropriate than the traditional 
method of establishing a reliable design basis for structures sub- 
jected to ground motions. The simulation of ground motion is un- 
doubtedly a necessary step in performing such a probabilistic analysis. 
Because earthquakes are unpredictable, some researchers in structural 
and earthquake engineering have attempted in recent years to simu- 
late earthquakes by using stochastic processes. Both stationary and 
nonstationary models have been investigated.?-* 

It is the purpose of this paper to investigate the best characteriza- 
tion of ground motions and to establish a valid basis for the stochastic 
simulation of seismic records. For this purpose, 12 commonly used 
strong-motion earthquake accelerograms are analyzed. The time varia- 
tions of the rms accelerations, the autocorrelation functions, and the 
power spectral density functions of these accelerograms are inves- 
tigated. The stationary rms accelerations are used as a measure of 
earthquake intensities and are compared with those found by Housner.® 
From the power spectral density analysis of existing ground-motion 
records, a linear filter can be determined to represent the transfer 
characteristics of the ground layers at a particular site. This filter 
is used in developing a method of generating a gaussian stationary 
process to simulate ground-motion accelerations. 

In Section III the generation of synthetic ground acceleration 
records using a digital computer is discussed. The synthetic records 
are generated from existing records and from estimated response 
spectra. Two practical examples, of importance to structural engineer- 
ing, are illustrated in Section IV. 


II. STATISTICAL ANALYSIS OF GROUND-MOTION DATA 


In general, no individual record is representative of any other record 
in an ensemble. However, if the data are stationary, valuable statistics 
may be derived by averaging the existing records. If an ergodic process 
is considered, a single record will be sufficient to represent the entire 
process. 

The mean value of a given time-history record x(t) of duration 
T is defined by 


i) = Bf a0 at. (1) 
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The mean square value of x(¢) is defined by 


rn = 4 f 220) dt. (2) 


Following this definition, the root-mean-square or the rms value 
of x(t) is the positive square root of the mean square value (z?),,. This 
is given by 


rms of x(t) = E [ x(t) a} (3) 


If x(t) is a stationary random process with zero mean, its autocor- 
relation function R,(r) and power spectral density function S,(w) 
are given by the following transform pair: 


ree lim : "OTE ASAE (4) 


SG)= - i ; R.(ae** dr. (5) 


Notice that R,(r) is always a real-value even function with a 
maximum at + = 0 and that, if (x),, = 0, 


(0) =f Sale) do = Timm (2%) 6) 


that is, the maximum autocorrelation represents the mean square 
value of a stationary random process. 

These simple statistical quantities defined in equations 1 through 
5 are useful in applying random vibration theory in earthquake 
engineering. The mean value, mean square value, and root-mean- 
square value represent the time-average strength of the input func- 
tion. The timt variations of (z)ay, (v”)ay, or rms of x(t) can be used 
to test the restricted sense stationarity of a time series. The auto- 
correlation function and the power spectral density function are closely 
related to the second-order properties and are generally used as char- 
acterization functions of a stationary process. Since strong-motion 
earthquake accelerograms are, in general, gaussian,® either the auto- 
correlation functions or the power spectral density functions will be 
sufficient to provide a complete statistical description. These two func- 
tions also provide a mathematical basis for the random response anal- 
ysis of linear structural systems. 

Tf x(t) is given in digitized form, equations 1 through 5 can be 
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written respectively as follows: 


1 N 
it) = N dX Ley (7) 
eet Yat (8) 
Vhew NO k 3 
fej =)= 9 
rms of x(t) iat} ' (9) 
R, = B.{k At) = ae k=0,1,2,---,m, (10) 


and 


s. = 8.6) = “|r, Le yo R: cos 2 + (— yr. |, 


7=1 
k= 0,1,2, +++ ,m, (11) 


in which N = T'/At equals the digitization time interval At, x, = 
z(kAt), and m represents the maximum lag number. 


2.1 Six Strong-Motion Earthquakes 


Using equations 7 through 11, six commonly used strong-motion 
earthquakes each with two horizontal components are analyzed. They 
are: 


A. El] Centro, California, December 30, 1934, N-S. 

B. The same, but E-W. 

C. El Centro, California, May 18, 1940, N-S. 

D. The same, but E-W. 

Kk. Olympia, Washington, April 13, 1949, S10E. 

F. The same, but S8OW. 

G. Taft, California, July 21, 1952, S21W. 

H. The same, but N69W. 

I. Golden Gate Park, San Francisco, March 22, 1957, N10E. 
J. The same, but S80H. 

K. Alameda Park, Mexico City, May 11, 1962, N10°46’W. 
L. The same, but N79°14’E. 


The accelerograms for these earthquakes are presented in Fig. 1. 
Upon first inspecting these existing strong-motion accelerograms, one 
might conclude that they are nonstationary. However, this conclusion 
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Fig. 1— Strong motion earthquake accelerograms. El Centro, Calif., Dec. 30, 
1934, (a) N-S, (b) E-W; El Centro, Calif., May 18, 1940, (c) N-S, (d) E-W; 
Olympia, Wash., Apr. 18, 1949, (e) S10E, (f) S80W; Taft, Calif., July 21, 1952, 
(g) S21W, (h) N69W; Golden Gate Park, San Francisco, Calif., March 22, 1957, 
. aaa a) S80E; Alameda Park, Mexico City, May 11, 1962, (k) N10°46’W, 
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is debatable, considering the lack of sufficient ground-motion data 
for statistical studies, the difficulties of establishing valid nonsta- 
tionary characteristics, and the ultimate objectives in developing a 
useful stochastic model for earthquake-induced ground accelerations. 

The time variations of the rms amplitude of these strong-motion 
accelerograms are obtained and presented in Fig. 2, from which it is 
seen that the earthquake rms acceleration in general approaches a 
stationary value as the duration of the accelerogram is increased. This 
phenomenon indicates that strong-motion accelerograms are sta- 
tionary in their rms amplitude at long duration (JT > 20 seconds, ap- 
proximately). 

The longest and shortest periods existing in a digitized time-history 
record are 27 and 2At, respectively. These two extremes constitute an 
effective period range for the record. Any analysis of the record be- 
yond its effective period range will be of no significance. A At of 0.01 
second was used in this study. 

The stationary rms acceleration can be used to measure the inten- 
sity of an earthquake if it is accompanied by the corresponding effec- 
tive period range. The rms intensities for all earthquakes, assuming 
T = 20 seconds (corresponding to a period range of 0 to 40 seconds), 
are listed in Table I. This table shows that the N-S component of 
the El Centro 1940 earthquake (C) has the strongest accelerogram. 

It is interesting to compare earthquake rms intensities with the 
traditionally used Housner intensity,® which is defined as 


2.5 


Sh = S.0d, T,)-aT, , (12) 


0.1 


where S, is the solution of the following equations: 





a+ “w+ Sue —#,(t) (13) 
ie he : 
S,(A, 7.) = max | % | (14) 


for constant coefficients A and T,. 

Physically, equation 13 represents the equation of motion of a basic 
single-degree-of-freedom linear system with mass m, viscous damping 
i, and stiffness k subjected to earthquake excitation %,(¢) at the sup- 
port, as shown in Fig. 3. Correspondingly, T, = 27 (m/k)™* is the 
natural period of the system, and S, in equation 14 is the maximum 
relative velocity reached by this system during the earthquake excita- 
tion. Therefore, Housner’s intensity definition actually represents an 
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earthquake’s potential peak velocity of structures in the period range 
0.1 to 2.5 seconds. In Table II the rms (normalized to a factor of 
2.7/2.20) and Housner’s intensities are compared and good agree- 
ments are observed. The basic difference between these two intensity 
definitions is that the rms intensity is independent of the transfer 
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Fig. 2— Root-mean-square acceleration of strong-motion earthquakes. (a) 
El Centro and Golden Gate, (b) Olympia and Taft, (c) Alameda Park. 
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Fig. 3 — Linear mechanical system subjected to ground excitation. 


characteristics of the structure while Housner’s intensity depends on 
both the input and the transfer characteristics of the structure. It is 
much simpler to calculate the rms value for a given input time- history 
function than to find the SJ, values, which require numerous mathe- 
matical integrations on the computer. However, the rms intensity 
should be used with care; one should consider the associated effective 
period range and the predominant period contained in the waveform. 


2.2 Determination of the Predominant Period 


The autocorrelation and power spectral density for all accelerograms 
considered are found on a digital computer by using a Fortran program 


TaspLhE I—RMS Inrensities or Srronc-MoTion EARTHQUAKES 








RMS acceleration Average 
at T = 20s rms 
Case Identification (ft/s?) (ft/s?) 
A EC34N8 1.4 1.4 
B EC34EW 1.4 
C EC40NS 2.4 2.20 
D EC40EW 2.0 
E OL49S10E 1.7 1.85 
F OL49880W 2.0 
G TA52821W 1.4 1.4 
H TA52N69W 1.4 
I GG57880E 0.8* 0.7* 
J GG57N10E 0.6* 
K AL62N10°46’E 0.75 0.725 
L AL62N70°14’E 0.7 


* Taken at T = 10.0s. 
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based on equations 10 and 11. Those functions, R(r) and S(w), for the 
FE] Centro 1940 earthquake are shown in Figs. 4 and 5 for the N-S 
and E-W components, respectively. The smooth curves in the power 
spectral density diagrams (Figs. 4b and 5b) were obtained by introducing 
a Hanning or smoothing procedure” to the raw estimates given by 
equation 11. The autocorrelation functions (Figs. 4a and 5a) all have 
a maximum at 7 = 0 and diminish rapidly at large correlation time. 
The ordinates of the power spectral density functions (Figs. 4b and 5b) 
generally increase with increasing frequencies to a maximum value at 
some frequency which may be considered a predominant or character- 
istic ground frequency, and then decrease rather rapidly toward zero 
in an asymptotic manner. Also of interest is the fact that this general 
rise and fall of the power spectral density function is accompanied by 
local random fluctuations. 

The above results indicate that the autocorrelation and power spec- 
tral density of strong-motion earthquake accelerograms resemble those 
of a narrowband process. This implies that earthquake acceleration 
may be simulated by passing a wideband process through a linear 
filter which reflects the local geological conditions. 


III. STOCHASTIC. SIMULATION OF GROUND-MOTION ACCELEROGRAMS 


3.1 Basic Requirements for Ground-Motion Model 


There are many occasions when a ground-motion model is required. 
Examples are the prediction of ground motion at a certain site where 
no past records are available, and statistical analyses of structural 
responses based upon very limited actual ground-motion records. The 
hypothesized models must, of course, possess the pertinent characteris- 
tics of real ground motions and must be supported by existing data. 
More importantly, these models must properly reflect the damage (or 
response) potential of future ground motions to a wide range of struc- 
tures, 


TaBLE IJ — CoMPARISON OF EARTHQUAKE INTENSITIES 


Earthquake 
case 





A&B C&D E&F G &{H L&J K&L 
Normalized 
rms intensity 17 2.7 2.2 1.7 0.84 0.9 
Housner 


intensity 1.9 2.7 1.9 1.6 unavailable 
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Fig. 4—N-S component of earthquake at El Centro, Calif., May 18, 1940. 
(a) Autocorrelation function. (b) Power spectral density function. (c) Compari- 
son of actual and idealized power spectral density functions. 
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Fig. 5—E-W component of earthquake, El Centro, Calif, May 18, 1940. 
(a) Autocorrelation function. (b) Power spectral density function. (c) Compari- 
son of actual and idealized power spectral density functions, 
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The random phenomenon observed in earthquake ground motions 
suggests that one can reasonably assume that the resultant seismic 
wave arriving at the surface of the ground will contain a random as- 
semblage of velocity impulses having gaussian distribution. Further- 
more, the analyses of the earthquake accelerograms in the previous 
sections show that they have characteristics resembling those of a 
narrowband process. It is therefore postulated that the use of stochas- 
tic processes would be appropriate in modeling earthquake ground 
motions. 

Basic criteria for establishing such a stochastic model can be spe- 
cifically stated as follows: 

(1) The model must have the basic properties reflected by the past 
recorded data such as the intensity, duration, general physical ap- 
pearance, and all important characteristics resulting from local geo- 
logical conditions. 

(1) The response statistics of the stochastic model must be equi- 
valent to those produced by the real ground motion or to those 
predicated on strong theoretical or empirical bases. 


Some simplifying assumptions required in the development of our 
stochastic model for earthquakes are: 


(z) The input seismic wave transmited at bedrock by an earth- 
quake is represented by stationary, white noise. 

(wi) The ground layers of a seismic station during the shock are 
represented by a single-degree-of-freedom system with linear be- 
havior. (A more sophisticated ground-layer filter may also be used.) 

(111) Local random fluctuations appearing in the power spectral den- 
sity function of the real earthquake accelerogram are neglected when 
modeling the transfer characteristics of the site. 

The representation of ground layers by a single-degree-of-freedom 
system is shown in Fig. 6. Such a simple system is characterized by its 
transfer function h(r) in the time domain or H(iw) in the frequency 
domain, with h(r) and H(iw) known as the unit impulse response and 
the complex frequency response of a linear system, respectively. 

If the simple mechanical system shown in Fig. 3 is used as the linear 
filter, the power spectral density of the total acceleration of the mass, 
when the acceleration at the support is taken as the input, can be 
found in terms of the corresponding transfer function. 


Si[l + 4d¢(@/e,)"] 


Sie) = THO) TS = TH Gofal + A(o/o,) 


(15) 
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Fig. 6 — Ground layers represented by a linear system. 


where S; is the constant power spectral density of the input acceleration 
by assumption (2); A, and w, are the ground characteristic damping and 
frequency, respectively. 

For known S;, \,, and w,, equation 15 gives a smooth curve for 
S,(w). The location of the peak amplitude of S,(w) is determined by 
w,. The shape or the rate of the rise and fall of S,(w) is governed by 
A, , and the relative amplitude of S,(w) depends on S; . For any power 
spectral density derived from a real ground-motion record, an idealized, 
smooth equivalent power spectral density can be found by using equa- 
tion 15 and by properly adjusting the three-characteristic constant 
\,, #,, and S; in this equation. The idealized power spectral density 
function for both horizontal components of the El Centro, California 
1940 earthquake were found by the prescribed procedure. As shown in 
Figs. 4c and 5c for both cases, the idealized power spectral density 
function covers the same area as the actual power spectral curve in the 
frequency range 0 through 10 cps. The w, , \,, and S; values used for 
the N-S and E-W components of the El Centro 1940 earthquake are 
15.5 rad/s, 0.42, 0.046 ft”/s*/rps, and 14.7 rads/s, 0.41, 0.033 ft”/s*/rps, 
respectively. 

Once the representative ground-layer filter for a given ground motion 
is determined, the artificial earthquake can be generated by passing 
white noise through this filter and measuring the output time history. 


3.2 Generation of a Gaussian Stationary Process 


This procedure starts by sampling a sequence of pairs of statistically 
independent random numbers 21, 22; %3, 043 *'* 3 Ln-1, Ly all of which 
have a uniform probability distribution over the range0 << x < 1. A 
new sequence of pairs of statistically independent random numbers 
V1, Y23 U3) Y43 °°" 5 Yn-ty Yn are then generated using the relations 
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¥: = (—2 log, a,)? COS 2rXi41 PET 3a (16) 
Yier = (—2 log, x,)? sin 2rx,4, 

which have been shown to have a gaussian distribution with a mean 

of zero and a variance of unity.1!-8 

A single waveform y(t) can now be established by assigning the 
values ¥1, Yo... Yn to n successive ordinates spaced at equal intervals 
At along a time abscissa and by assuming a linear variation of ordi- 
nates over each interval. To define a time origin, assume that the 
initial ordinate y,, which is taken equal to zero, is located at t = t, 
where ¢, is a random variable having a uniform probability density 
function of intensity 1/At over the interval 0 < tf, < Al. 

For practical reasons At must, of course, be taken as finite; how- 
ever, its value should be set sufficiently small so that the true power 
spectral density function is reasonably constant at intensity S; over 
the lower range of frequencies which are to be properly represented in 
the process. A value of 0.025 second or smaller is recommended. 

To establish the desired stationary process a(t), each member y,(t) 
(r = 1, 2, --- , N) of the normalized process y(£) must be filtered in 
accordance with equation 15. This step can be accomplished by assum- 
ing that a simple single-degree-of-freedom system having an undamped 
circular frequency w, and a damping ratio \, is subjected separately to 
the N support accelerations y,(£) and then by calculating the correspond- 
ing absolute or total acceleration functions a,(é) of the mass. In mathe- 
matical form this statement is equivalent to saying that one must solve 
the differential equations 


Z,@) + 20, 2,() + 03Z,) = -yQ r=1,2,---,N (17) 


for the functions Z,(t) and then evaluate the desired family of accelera- 
tion functions a,(¢) using the relation 


a()=Z.)+y 1r=1,2,---,N. (18) 


Equation 17 can be solved numerically on a digital computer using 
the standard linear acceleration method. Based on the assumption that 
the input earthquake duration is divided into the very short equal 
time intervals At, and the response acceleration varies linearly over 
each interval, this method gives the following simple relations be- 
tween the response displacement Z, and its derivatives at step n and 
n+1: 
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@). Mya (Zonas as 


(Zona ms (Za + At ae At 


(Z i) Gone 


(Zener = (Zen + (Zr)n At + wg Ate At’. (19) 


IV. ILLUSTRATIONS OF GENERATION OF ARTIFICIAL ACCELERATION 


4.1 By Known Power Spectral Density 


As the first example, artificial earthquakes are generated using the 
prescribed approach, to simulate the average of U. S. strong-motion 
earthquakes observed at seismic stations having a firm soil founda- 
tion. Values of 15.6 rad/s for w, and 0.6 for A,, representative of such 
soil conditions, were used. A total of 50 artificial accelerograms (N 
= 50) were generated for process a(t) with a duration of 30 seconds 
which corresponds to At = 0.025 second and n = 1200. The intensity 
S; of the unfiltered “white noise” was set at 0.00614 ft?/s? so that the 
mean velocity response spectrum curves for the filtered process a(t) 
would give a “best fit” with the standard response spectrum curves 
published by G. Housner.® This intensity is slightly less than the 
value of 0.0063 ft?/s* used by J. Penzien™‘ in a previous investigation 
to correlate the mean velocity response spectrum curves for “white 
noise” with Housner’s standard curves. 

A typical sample member of artificial accelerograms generated is 
shown in Fig. 7. It is interesting that this accelerogram appears to be 
very similar to the real accelerograms in Fig. 1 except for the general 
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Fig. 7—-Sample member of artificial accelerogram, 
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Fig. 8— Autocorrelation function of the sample member of artificial accelero- 
gram. 


stationary apparance of the artificial accelerograms versus the non- 
stationary appearance of the real accelerograms. 

The autocorrelation and corresponding power spectral density func- 
tion for this sample artificial earthquake are shown in Figs. 8 and 9, 
respectively. It is of particular significance that, while these autocor- 
relations and power spectral densities are similar to those for the real 
earthquake (Figs. 4 and 5), local random fluctuations also appear in 
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Fig. 9— Power spectral density function of the sample member of artificial 
accelerogram, 
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Fig. 10 — Comparison of velocity spectra of earthquakes, \ = 0.02. 


the individual power spectral density function. It is expected that 
these local fluctuations and the variations from one individual power 
spectral density function to another will be eliminated by the averag- 
ing procedure. It has been shown that the average power spectral den- 
sity function of artificial earthquakes is quite close to the prescribed 
function.25 

The average response spectra (A = 2%) of 50 artificial earthquakes 
are compared with the real spectra given by Housner® in Fig. 10. Good 
agreements are observed over the significant period range. The con- 
fidence limits for these average spectra are given in Fig. 11 by extend- 
ing 3c (standard deviation) of S, above and below the mean. Since 
the distribution of the response spectra is no longer gaussian, these 
limits correspond to an approximate 89 percent confidence band 
according to Chebyshev inequality.*® 1” 

The above results (Figs. 7 through 11) show that the basic require- 
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Fig. 11— Variations and confidence limits for average velocity spectra of 
artificial earthquakes, \ = 0.02. 
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ments for ground-motion simulation have been satisfied. It takes ap- 
proximately 10 seconds to generate one sample function on a CDC 
6600 digital computer. These facts indicate that a good, reasonably 
economical ground-motion simulation has been achieved. 


4.2 By Known Response Spectrum 

In the second example, we have a situation somewhat different from 
the first. Engineers are asked to generate representative ground-mo- 
tion accelerations from an estimated pseudovelocity spectra as given 
in Fig. 12. There is no past recorded ground-motion data available. 
The generated artificial earthquakes will be used to evaluate the prob- 
able damage of structures resulting from possible seismic events tak- 
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Fig. 12 — Pseudovelocity response spectra, \ = 0.02. 
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ing place in the vicinity of a proposed site. The estimated response 
spectra were obtained from a careful examination of earthquake ac- 
tivities in a certain seismic area and from an analogous prediction 
using records taken at stations having geological conditions similar or 
comparable to those of the given site. If the event refers to an under- 
ground explosion, the estimated response spectra might be predicted 
from past events under the same physical conditions (yield level and 
depth of the explosion, epicenter distance, and so on) and similar 
geological environments. 


The given pseudoresponse spectrum curves (Fig. 12) show peaks at 
period 7, & 1.25 seconds if the long-period portion (7, > 2.5 seconds) 
or the low-frequency portion (w, < 2.5 rad/s) is neglected. The cor- 
responding frequency value where the peak spectrum amplitudes occur 
is approximately 5 rad/s. This value can be used as an approximate 
predominant ground frequency w, . The proposed site has a firm soil 
foundation, therefore a value of 0.6 for the characteristic ground damp- 
ing \, is used. 

Using these two characteristic values and letting S; be unity in 
equation 15, the same procedure as used in the first example can be 
followed again. Five typical artificial accelerograms of 30-second 
duration are generated and shown in Fig. 13. The individual response 
spectra and the mean response spectra are obtained (Fig. 14). The 
general shape of the pseudoresponse spectra using S; = 1.0 is similar to 
the estimated spectra. By matching the area covered by the estimated 
and pseudospectra curves, the amplitudes of the pseudospectra are 
normalized to have the same order of magnitude as the estimated 
spectra. The accelerations shown in Fig. 13 which have been modified 
by these same normalization procedures, represent the final form of 
the desired artificial accelerograms. 


The comparisons of individual and mean pseudoresponse spectra 
with the given response spectra (lig. 14) are satisfactory except for 
the high-period range (7, > 2.5 seconds). If only those general building 
structures are considered which have a natural period range of 0.3 to 
2.5 seconds, the large discrepancy of response spectra in the high- 
period range is not significant. The mean pseudoresponse spectrum 
(Tig. 14f) gives better results, as is expected. Overall results of the 
simulation may be improved with more accurate determination of values 
for w,, 4,, and the larger sample size N of the simulating process. 
Methods for such improvements are presented in a separate paper.’® 
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Fig. 18 — Pseudoearthquake accelerograms. 
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V. CONCLUSIONS 


From the results presented in this investigation, the following con- 
clusions may be deduced: 


(t) In general, strong-motion earthquake accelerograms of suf- 
ficiently long duration (T’ > 20 seconds, approximately) are station- 
ary in the rms sense. 

(2) The rms acceleration of an earthquake, along with the effec- 
tive period range and the predominant period of the associated ac- 
celerogram, can be used to determine the intensity of the earthquake. 

(2) The autocorrelation and the power spectral density of a strong- 
motion earthquake resemble those of a narrowband process. 

(iv) The power spectral density analysis of existing earthquake 
records suggests that the transfer characteristics of the given site can 
be represented by a simple, linear system. 

(v) A filtered, gaussian, stationary process generated on a digital 
computer proves to be successful in modeling ground motions in- 
duced by earthquakes or by nuclear underground explosions. 
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Reliable Information Storage in 
Memories Designed from 
Unreliable Components* 


By MICHAEL G. TAYLOR 
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This 1s the first of two papers which consider the theoretical capabilities 
of computing systems designed from unreliable components. This paper 
discusses the capabilities of memories; the second paper discusses the 
capabilities of entire computing systems. Both present existence theorems 
analogous to the existence theorems of information theory. The fundamental 
result of information theory is that communication channels have a capacity, 
C’, such that for all information rates less than C, arbitrarily reliable 
communication can be achieved. In analogy with this result, it is shown that 
each type of memory has an information storage capacity, C, such that for 
all memory redundancies greater than 1/€ arbitrarily reliable information 
storage can be achieved. Since memory components malfunction in many 
different ways, two representative models for component malfunctions are 
considered. The first is based on the assumption that malfunctions of a 
particular component are statistically independent from one use to another. 
The second is based on the assumption that components fail permanently 
but that bad components are periodically replaced with good ones. In both 
cases, malfunctions in different components are assumed to be independent. 
For both models it ts shown that there exist memories, constructed entirely 
from unreliable components of the assumed type, which have nonzero 
information storage capacities. 


I. INTRODUCTION 


The problem of designing systems which operate reliably even 
though their components are unreliable has been formulated in many 
different ways. In a typical formulation, one considers some particular 


*This work, which is based on part of a doctoral thesis submitted to the 
Department of Electrical Engineering, M.IT., September 1966, was supported by 
the National Aeronautics and Space Administration (Grant NsG-334). 
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system which performs a computation with a nonzero probability of 
error. The problem is to design some other “reliable” system which 
performs the same computation using the same types of components 
but with a smaller probability of error. In fact, the ultimate objec- 
tive is to show that it is possible to design systems, using only 
unreliable components, which perform computations with an arbi- 
trarily small probability of error. Unfortunately, there is no standard 
terminology for describing these systems; therefore, the following 
section introduces the terminology to be used throughout this paper. 


1.1 Definitions 

The computations performed by the computing systems are de- 
scribed in terms of elementary operations where an elementary opera- 
tion is any Boolean function of two binary operands. There are 
sixteen different elementary operations, each one of which can be 
represented by a binary matrix of the type shown in Fig. 1. Typical 
elementary operations are AND, or, and modulo-2 addition. The com- 
puting systems to be considered are constructed from components 
which are devices that either perform one elementary operation or 
store one binary digit. The complexity of a system is defined to be 
equal to the number of components within the system. 

In an irredundant computing system, the amount of computation 
performed by the system equals the number of elementary operations 
which are executed. Corresponding to each irredundant computing 


AND 0 1 
0 0 0 
1 0 1 
Fig. 1— Binary matrix for AND operation. There are 24 = 16 ways of filling 


this table, each one of which describes one of the 16 allowed Boolean functions. 
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system, there are many redundant computing systems which perform 
equivalent computations. These redundant systems are more complex 
than the equivalent irredundant one but, hopefully, they are also 
more reliable. The amount of computation performed by any one of 
these redundant computing systems is defined to be equal to the 
amount of computation performed by the corresponding irredundant 
system. Finally, the redundancy of a computing system equals the 
ratio of the complexity of the system to the amount of computation 
performed by the system. 

To illustrate the use of these terms, consider a system that com- 
putes, in parallel, the modulo-2 sums of the digits in two k-digit se- 
quences. The system first encodes each sequence of digits into a code 
word from an (n, k)* group code, then forms the modulo-2 sums 
of the digits in these code words, and finally decodes the result. The 
amount of computation equals k since an equivalent irredundant 
computer would simply perform k elementary operations each consist- 
ing of a module-2 sum. If the complexities of the encoder and decoder 
within the redundant system are Cy and Cp, respectively, the com- 
plexity of the entire system equals Cz + Cp + n, where the last term 
arises from the n modulo-2 adders required to perform the desired 
operation. The redundancy of this system equals (Cg + Cp + n)/k. 


1.2 Historical Background 

Von Neumann was one of the first to propose a system which uses 
redundancy to gain reliability.1 He considered systems consisting of 
interconnections of identical elements} where all the elements compute 
either the majority function or the Sheffer-stroke function. The form 
(network topology) of the redundancy network is similar to that of 
the original irredundant network, the precursor. Specifically, each 
element in the precursor is replaced by a set of 3n elements of the 
same type in the redundant network (redundancy = 3n), and each 
interconnection is replaced by a bundle of n interconnections. The 3n 
elements in each set are interconnected in such a way that there are 
n outputs. It is assumed that a malfunction occurs in a particular set 
of elements whenever more than a certain fraction, 6, of the n outputs 
are in error; 6 is chosen to minimize the probability of a malfunction 
within the entire system. Von Neumann showed that, for large n, the 


*mn is the length of each code word; k is the number of information digits in 
each word. 

+ The terms “element” and “network element” are used to indicate devices 
consisting of several components (some finite number). 
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probability that one set of 3n Sheffer-stroke elements malfunctions on 
one particular use is 


Pr (malfunction in one set of elts.) 2 6.4/(n)*- 107897701 


where it is assumed that the probability of error for each use of each 
element is 5.10-°. Therefore, for this system, the probability of mal- 
function decreases exponentially with the redundancy, provided that 
the redundancy is sufficiently large. Other approaches involving the 
use of more complex modules have led to results similar to those 
of von Neumann.? In some cases the resulting network is more ef- 
ficient (less redundant for a given probability of system failure) 
than von Neumann’s network. However, in all cases, to achieve an 
arbitrarily small probability of system failure it is necessary to make 
the redundancy arbitrarily large. 

It is interesting to compare von Neumann’s results with those 
obtained by Shannon concerning the reliability of communication 
systems.? Both show that the probability of error within the system 
can be made arbitrarily small. In the case of communication systems, 
this can be achieved for certain nonzero information rates by choosing 
the constraint length of the code arbitrarily large. The largest infor- 
mation rate for which the probability of error can be made arbitrarily 
small is called the capacity of the communication channel. By making 
an analogy with this result, one might expect that, in the case of a 
computing system, it should be possible to achieve an arbitrarily 
small probability of error for certain bounded values of the redun- 
dancy by choosing the complexity sufficiently large. Extending this 
analogy, the reciprocal of the minimum redundancy for which the 
probability of error can be made arbitrarily small is called the com- 
puting capacity of the computing system. For the systems proposed 
by von Neumann, there is no finite redundancy for which the proba- 
bility of error can be made arbitrarily small; therefore, these systems 
have a computing capacity of zero. 

The question remained whether there existed any method for 
designing a “reliable” system with a nonzero computing capacity. 
Since it was well known that by using suitable coders and decoders 
it is possible to obtain a “reliable” communication system, it was 
natural to attempt to apply coding techniques to the problem of 
designing a “reliable” computing system. One approach is to con- 
sider coding the inputs to each computing component and decoding 
the output. Elias set out to show that this method could not be used 
to design a general computing system with a nonzero computing 
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capacity.* Since Elias desired a negative result, he permitted all 
coders and decoders to be noiseless, and since a general computing 
system must be capable of performing all 16 elementary operations, 
Elias considered 16 types of computing components, each capable 
of performing one of the elementary operations. 

The computing components were divided into two classes. The 
operations performed by components in the first class were those 
represented by matrices containing an odd number of ones and zeros 
and in the second class were those containing an even number. Elias 
showed that components in the first class have a computing capacity 
equal to zero; but that it is possible for components in the second 
class to have a nonzero computing capacity.* 

Unfortunately, the only component in the second class which per- 
forms a nontrivial operation is the modulo-2 adder; furthermore, there 
is no combination of class two components that can perform class 
one operations such as AND and or. Therefore Elias concluded that 
this coding technique could not be used to design a general computing 
system with a nonzero computing capacity. 

More recently Winograd and Cowan proposed another scheme for 
designing a “reliable” computing system.® Their approach was very 
similar to von Neumann’s. However, instead of considering a single 
irredundant network as the precursor for the redundant network, 
Winograd and Cowan considered a composite network consisting of 
k copies of this irredundant network, each computing independently, 
to be their precursor. If the original irredundant network has a com- 
plexity o then this composite network has complexity ko; but its re- 
dundancy is still one since each network is capable of performing 
independent operations on different inputs. 

To introduce redundancy into this composite network, one con- 
siders sets of k network elements, the members of each set being the 
corresponding elements in each of the k precursors. The redundant 
network is formed by replacing each set of k elements with n modules. 
These modules have the property that each of their inputs is encoded 
according to some (n, k) block code, thus allowing each one to per- 
form an error correction operation on each of its inputs. Every set 
of m modules performs the appropriate operations on the corrected 
inputs so that the n binary outputs from the n modules form the 


* To achieve this nonzero computing capacity, it is necessary to assume that the 
complexity of the encoders and decoders grows only linearly with the block 
length of the code as in the case of convolutional coders and sequential 


decoders.5-7 
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code word corresponding to the desired result, namely the code word 
whose information digits are equal to the corresponding k& outputs in 
the original composite network. 

Winograd and Cowan assumed that modular malfunctions are 
statistically independent from one module to another; furthermore, 
they also assumed that for all modules except the “output modules” 
the probability of a modular malfunction is po, independent of the 
operations performed by that module. The “output modules,” those 
modules whose output constitutes the output of the computing system, 
were assumed to be noiseless. To compute a bound on the probability 
of error for this system, each set of n noisy modules can be modeled 
by a set of m noiseless modules, where the output from each module 
is passed through a binary symmetric channel (BSC) with crossover 
probability po. A failure occurs whenever the output from any set of 
n BSC’s is such that a noiseless decoder would be unable to decode 
this word correctly. According to the noisy channel coding theorem,°® 
there exist codes for which the probability of such a failure is bounded 


by 
Pr [modular failure] < e~""™ 


where H(R) > 0 for codes with information rates less than the capac- 
ity of a BSC with crossover probability 9. Since there are at most 
no modules within the network and since each module is used only 
once during the computation, the probability of a malfunction any- 
where within the network during the computation is bounded by 


Pr [failure in network] < no-e"?“” 
which can be made arbitrarily small by making n sufficiently large. 
If the complexity of the modules were fixed, this result would 
imply that the probability of error can be made arbitrarily small by 
making the complexity sufficiently large, while keeping the redundancy 
bounded. Unfortunately, each module must perform encoding and 
decoding which requires a number of operations that grows at least 
linearly with n. Therefore, the complexity of the modules must grow 
at least linearly with n which implies that the redundancy of the 
overall network must grow at least linearly with n rather than being 
bounded as one would have hoped. Therefore, the probability of 
error for the system can be made to approach zero only in the limit 
of infinite redundancy. Hence, Winograd and Cowan’s system also 
has a computing capacity equal to zero. 
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1.3 Error Criteria 


Although the term “probability of error” is used in connection 
with each of the three systems discussed in the previous section, the 
error criterion was different for the different systems. Elias, Wino- 
grad, and Cowan assumed that each input to the computing system 
is uncoded and that the output from the system is also uncoded. 
They assumed that for each set of inputs, the system has one correct 
output defined as the output which would be obtained if the system 
were noiseless. If the actual output differs from the correct output, 
the system has made an error. To obtain a probability of error for the 
system that is not lower bounded by the probability of error for 
the output components, they required that the output components 
be noiseless. 

The necessity for using noiseless output devices arises because of 
the requirement that the output be uncoded. Von Neumann avoided 
this problem by assuming that all inputs and outputs are repeated 
n times. He assumed that the result is “correct” provided that the 
fraction of the outputs which are in error is smal]. Von Neumann’s 
assumption that all inputs and outputs must be repeated is a special 
case of the assumption that all inputs and outputs must be coded 
according to some error correcting code. The latter, more general 
assumption is made in this paper. The only condition that is imposed 
is that both the inputs and the output must be coded according to 
the same code so that computing systems are compatible with each 
other. Since the outputs are coded, there must exist classes of outputs 
corresponding to the decoding equivalence classes of the code. A 
result is considered to be correct provided that it is within the class 
which contains the code word corresponding to the desired result. 

The concept of coded inputs and outputs might, at first, seem 
unrealistic since it implies that the user is capable of performing 
error correcting coding and decoding. However, if we consider the 
case of two people communicating with each other, we observe that 
a very complicated process of coding and decoding takes place. Ap- 
propriate redundancy is introduced not only through the inherent 
redundancy in the language but also through the use of “diversity 
channels” which, in this case, correspond to facial expressions, hand 
and arm movements, voice inflections, and so on. Therefore, since 
there is always an appropriate coding used in the transmission of 
information between individuals, it 1s unrealistic to expect that a 
machine and user could communicate without the use of some type of 
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error correcting procedure. In fact, the condition that all information 
must be coded is a necessary requirement for the reliability of a 
computing system in which every component is noisy. 


1.4 Synopsis 

It is our ultimate objective to show that it is possible to construct 
from unreliable components a reliable computing system where a 
reliable system is defined as a type of system which has a nonzero 
computing capacity. For the first part of the analysis, it is assumed 
that component errors are statistically independent from one compo- 
nent to another and from one use of a particular component to another 
use. A computing system constructed from components of this type 
is called a noisy computing system. A virtually identical analysis 
and similar results apply in the case where components within the 
system fail permanently but where periodic maintenance is performed 
on the system; that is, at regularly spaced times, components which 
have failed are replaced by good ones. In this paper we restrict our 
attention to memories. It is shown that information can be stored 
reliably within a “stable memory,” a device constructed entirely 
from unreliable components. The paper following this shows that it 
is possible to design a computing system, using unreliable components, 
that performs operations reliably on information stored within stable 
memories, 


II. STABILITY 


The remainder of this paper is concerned with reliable information 
storage in memories constructed from unreliable components. A mem- 
ory is a device in which information is stored at one time and recov- 
ered at some later time. If a memory is to be useful, it must have 
two important characteristics. First, it should be possible both to 
store information in the memory and to read information from the 
memory at any time specified by the user, or at least at any one of 
a set of discrete times where the members of the set are closely 
spaced. All memories considered in this paper can have information 
stored in them at any time and retrieved from them at any time. 
Second, the information read out of the memory must be identical 
to or at least equivalent to the information originally stored. With 
memories constructed entirely from unreliable components, it is un- 
reasonable to expect that the word read out of the memory will be 
identical to the word stored; however, we can hope that the informa- 
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tion will be preserved. To clarify this idea of information preservation 
we introduce the concept of stability. 


2.1 The Concept 


To illustrate the meaning of stability, let us consider a simple 
memory consisting of one noisy register and a correcting network. 
The state of this memory is defined by the word contained within 
the register. It is assumed that initially (¢ = 0) a code word from 
some error correcting code is read into the register, thus defining the 
memory’s initial state. Since the register is noisy, errors occur in the 
stored digits; hence, the state of the memory is perturbed away from 
the original one. The correcting network monitors the contents of 
the register, performs error correction, and inserts into the register 
an estimate of the original code word. If there were no correcting 
network, the state of the memory would “wander away” from the 
original one; however, the correcting network provides a “restoring 
force” which tends to bring the state of the device back to the original. 
The noise may perturb the state of the device beyond the error cor- 
recting capability of the correcting network. If this happens, we say 
that a memory failure has occurred. To define a memory failure more 
precisely, it is necessary to associate with the different input code 
words disjoint classes of states. As long as the state of the memory 
remains within the appropriate class, no memory failure has occurred. 

The redundancy of a memory is defined to be the ratio of the complex- 
ity of the memory to the complexity of an irredundant memory which 
has the same information storage capability. The inputs to the memory 
being considered are code words from an (n, k) block code; hence, the 
memory has a storage capability of k bits. This memory is denoted by 
M,. An irredundant memory with the same information storage cap- 
ability as 17, would have a complexity equal to k, since it would consist 
of k one-bit information storage components. If we consider k to be a 
variable, we can speak of a sequence of memories where M, is a typical 
member of the sequence. If the complexity of every memory in this 
sequence is less than a-k, the redundancy of any memory in the sequence 
must be less than a which, by assumption, is independent of k. Hopefully, 
for any 7’, it is possible to make the probability of a memory failure 
during the time interval 0 $ ¢ S$ T arbitrarily small by choosing k 
sufficiently large while keeping a bounded. If the sequence of memories 
has this property, we say that the sequence is stable. For convenience 
we refer to a typical member of a stable sequence of memories as a 
stable memory. Here is a more concise definition of stability. 
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2.2 The Definition — 

Consider a sequence of memories denoted by {M;,}. The memories 
in this sequence are ordered according to their information storage 
capability; that is, the kth memory, M;, has a storage capability of 
k bits. The sequence {M;} is called stable if it satisfies the following 
conditions: 

(i) For any k, M, must have 2* allowed inputs which are denoted 
by {Tic}, 0 <i < 2. 

(it) With each input there must be associated a class of states of 
M,. The classes associated with different inputs must be disjoint. 
For any k and 7, the class of states corresponding to J; is denoted by 
C (Tui). 

(iit) The complexity of M;, must be bounded by a:k where a is a 
fixed parameter for any particular sequence. 

(tv) Suppose that at ¢ = 0 any one of the allowed inputs is trans- 
mitted to each memory in the sequence. Let there be no inputs for all 
t > 0. Consider a typical memory M;,. Denote the particular input 
that was transmitted to M;, by I,,. The probability that the state of 
M,, does not belong to C(I,;) at t = T is denoted by pxi(T) and max; 
[pui(T’) ] is denoted by p;(7'). If the sequence is stable then, for any 
T > 0 and 8 > 0, there must be a k such that p;,(T) < 8. 


2.3 Examples of Memories 

To further clarify the meaning of stability, consider two types 
of memories. The first consists of a noisy register without any cor- 
recting network and the second consists of a noisy register with a 
noiseless correcting network. In light of the discussion in Section 2.1, 
we do not expect that memories of the first type can be stable whereas 
we do expect that memories of the second type can be. These expecta- 
tions are correct. 

Consider first a sequence of noisy registers. A typical member of this 
sequence is a register containing n binary digits which define the 
register’s state. Let p denote the probability that any particular digit 
stored in the register is changed during a time interval r. If one is 
interested in the contents of the register only at the times t = 0, 7, 27, 
37, ++ , a model for the noisy register is the noiseless register together 
with the nm binary symmetric channels shown in Fig. 2. The allowed 
inputs to this noisy register, 31, , are the 2" code words from some (n, k) 
code. The class of states corresponding to a particular input is the decod- 
ing equivalence class corresponding to that code word. The complexity 
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Fig. 2— Model for a noisy register. The digits in the noiseless register are 
transmitted through the BSC’s once every 7 seconds. 


of Wt, isn = k/R, where R is the information rate of the code; therefore, 
the appropriate value of a for this sequence is 1/R. 

Suppose that at é = 0 one of the allowed inputs is transmitted to 1%, . 
At t = 7 the probability of error per digit is p. The probability of a 
memory failure at t = + equals the probability that the word in the 
register at ¢ = 7 does not belong to the decoding equivalence class con- 
taining the original input. Provided that the code is at least as ‘‘good” 
as an average random code, the noisy channel coding theorem” states 
that p.(r) is bounded by 


p(t) S exp — [(k/R) E(R)] 


where H(R) is positive for R < 1— H(p). [1 — H(p)] is the capacity 
of a BSC with crossover probability p. Since the probability of failure 
is independent of the particular input it is unnecessary to perform 
the maximization over all inputs. 

Next consider the state of the register at time ft = T = Lr. Accord- 
ing to the model in Fig. 2, to determine the state of the noisy register 
at ¢ = Lr we must transmit the n binary digits through their respective 
BSC’s LZ times. This is equivalent to transmitting each binary digit 
through LZ BSC’s in series. Since the overall capacity of these chan- 
nels in series decreases asymptotically to zero as L increases, for any 
fixed rate there must exist some LZ for which the capacity of the L 
channels in series is less than R. Therefore, for any sequence of noisy 
registers with bounded redundancy (fixed « or &), there must exist 
some LZ (or 7) such that there is no register in the sequence which 
has a sufficiently small probability of failure to satisfy the require- 
ments for stability. Thus, as would be expected, noisy registers are 
not stable. 

As a second example, consider the same sequence of noisy registers 
but this time associate with each register a noiseless correcting net- 
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work, Within each time interval of length 7, this correcting network 
takes the output from the n BSC’s and maps this vector onto a code 
word which is then inserted into the register to define its new state. 
This type of device is shown in Fig. 3. The operation performed by 
the correcting network is equivalent to that performed by a noiseless 
decoder followed by a noiseless encoder. This operation can be per- 
formed by a correcting network whose complexity is proportional 
to k; for example, a noiseless sequential decoder followed by a noise- 
less convolutional encoder.*-? If such a correcting network.is used, 
the redundancy is independent of k and therefore can be bounded 
for all k. The probability of a memory failure at ¢ = 7 is again upper 
bounded by exp — [(k/R) E(R)] but the probability of a memory 
failure at ¢ = Lr is now bounded by 


pi(Lr) < L-exp — [(k/R) E(R)] 


according to the union bound. Therefore, for any finite L, p,(Lr) 
approaches zero as k approaches infinity provided that R < 1 — H(p). 
This shows that a noisy register with a noiseless correcting network 
can be stable. 


Ill. THE STABILITY OF MEMORIES CONSTRUCTED ENTIRELY FROM 
UNRELIABLE COMPONENTS 


We now show that a noisy register with a noisy correcting network 
can be stable. This proof of stability is extended to memories in which 
components fail permanently but where the components which have 
failed are periodically replaced. 
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Fig. 3— Noisy register with noiseless correcting network. 
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3.1 The Importance of Low-Density Parity-Check Codes 

The memories to be considered store information in the form of 
code words from an error correcting code. Each memory consists of 
one or more noisy registers and a noisy correcting network that 
performs operations which are very similar to those performed by a 
decoder. It is our objective to show that noisy memories of this type 
can be stable. Since the complexity of a stable memory is required 
to be bounded by a:k, where k is the information storage capability 
and a is a proportionality factor which does not depend on k, the 
complexity of the correcting network in any stable memory can 
grow only linearly with k. There are only two kinds of correcting 
networks (decoders) known to have this property. One is a correcting 
network based on a sequential decoder® 7 and the other is a correcting 
network based on a low-density parity-check decoder.?° 

In deciding whether a particular correcting (decoding) procedure 
is suitable for use in a noisy correcting network, one must consider 
whether there are any essential steps in the procedure which could 
not be performed with a small probability of error by a noisy device. 
For example, almost all parity-check decoders are required to com- 
pute the modulo-2 sum of a set of digits where the number of digits 
in the set is proportional to the constraint length, N, of the code. 

To compute the probability that a noisy device makes an error 
in performing this operation, consider the noisy addition network 
shown in Fig. 4. This network, consisting of N — 1 adders (modulo-2), 
computes the modulo-2 sum of N binary inputs. Let us assume that 
each adder in the network has a probability of error p, and that adder 
errors are statistically independent from one adder to another and 
from one use of a particular adder to another. The output of the noisy 
addition network will be in error if an odd number of adders make 
errors. It can be shown? that the probability of this event equals 


1 — (1 — 2p,)""*_ 


Prob of error = 5 


This probability approaches 14, exponentially with NV, as N approaches 
infinity. 

The memories under consideration store coded information. To 
make the probability of a memory failure arbitrarily small (as re- 
quired for stability), one would expect that it would be necessary to 
make the constraint length of the code arbitrarily large. Therefore, 
the noisy correcting network must be able to perform error correction 
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Fig. 4— Modulo-2 addition network. © represents one adder (modulo-2). 


even if the constraint length of the code is very long. This implies 
that no correcting procedure involving a modulo-2 addition operation 
of the type just described is suitable for use in a noisy correcting 
network. For example, the correcting network based on a sequential 
decoder must generate hypothesized branches of a code tree. Each 
digit on one of these branches is computed by forming the modulo-2 
sum of a set of digits where the number of digits in the set is propor- 
tional to the constraint length of the code. The probability that a 
noiseless sequential decoder makes an error can be made arbitrarily 
small only if this constraint length is made arbitrarily large. But 
making the constraint length arbitrarily large makes the probability 
of an addition error within the noisy decoder arbitrarily close to 14. 
Therefore, a noisy correcting network should not be based on a se- 
quential decoder. 

Fortunately the correcting network based on a low-density parity- 
check decoder does not have this problem. A low-density parity-check 
decoder does evaluate parity checks, modulo-2 sums of the digits in 
parity-check sets; however, the number of digits in each parity-check 
set is not a function of the block length of the code. 


3.2 The Correcting Algorithm 

The memories to be considered consist of several registers and a 
correcting network and they store information in the form of code 
words from a low-density parity-check code. It is our objective to 
show that memories of this type can be stable. The first step is to 
consider the details of the correcting algorithm. This requires a brief 
explanation of low-density parity-check codes. 
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An (N, J, K) low-density parity-check code is defined to be a code 
with block length N such that there are K digits in each parity-check 
set and J parity-check sets containing any particular digit. Such a 
code can be represented by a parity-check matrix which has K ones 
in each row and J ones in each column. For example, an (N = 20, 
J = 4, K = 5)* low-density parity-check matrix is shown in Fig. 5. 
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Ooroocoocrcoeoorcoore 
RKOOOOCOCFCCOCOFROCrFOCO 
COOrcCcOrFCCOrFCOCOrFOoSo 
COrMOCOrFrCCOrFrcCOCOOCOrFrOCoO 
COrFCOCOOCrFrCOCrCOrFROCSO 
COOOrRrFOOCOFCOCOCOFOO 
COOOrFOCrFOCOCOCOrFRFOoCo 
COOFCOCOFCOFOCOrFOCO 
rFOoooooorcoororceoeo 
OroorcocoeocoorFrcoeoo 
COrFCOrFRCOrFRCOCCrOOCSO 


day da da dy dy2 dee die dys dz d3s dy di diz dis du dog ds ds dog ds 


Fig. 5— The parity-check matrix for a (20, 4, 5) low-density parity-check code. 
The digit positions, denoted by d with appropriate subscripts, are numbered in 
the way described in Fig. 6. 


Gallager has described two schemes for decoding low-density parity- 
check codes, both of which are iterative.t° However, we shall only be 
concerned with the simpler one. Each iteration of this scheme consists 
of first computing all the parity checks and then changing the value 
of any digit that is contained in more than a certain fixed number of 
unsatisfied parity-check constraints (if a parity check equals one, 
the corresponding parity-check constraint is unsatisfied). Provided 
that there were not too many errors initially, each successive itera- 
tion decreases the number of digits in error and, eventually, all parity 
check constraints are satisfied indicating that the resulting word is 
a code word. 

To illustrate how this method works, let us suppose that the digit 
dy is in error but that all other digits are correct. In this case, all 
parity-check constraints involving do will be violated, whereas at 
most one parity-check constraint involving any other digit will be 
violated. Therefore, dy will be changed whereas all other digits will 
be unchanged. In this way the digit dy will be corrected. If there are 


* Notice that K is always greater than J. 
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errors among the digits used to check dy, this digit may not be cor- 
rected during the first iteration. However, after one or more iterations 
sufficiently many of these errors may have been corrected to allow 
dy to be corrected also. 

This simple decoding algorithm does have one problem. Recall that 
for any digit do, the original values of all the other digits in the J 
parity-check sets containing dp are involved in the determination of 
the new value of dy and similarly that the original value of dp is used 
in computing the new value of each digit in these J parity-check sets. 
This means that on successive iterations the values of the digits 
involved in making a new estimate of dy depend on the previous 
estimate of do. This leads to a complex interrelation between the 
errors which degrades the decoder’s performance and, needless to say, 
greatly complicates its analysis. Fortunately Gallager has suggested 
a way to modify the algorithm which at least partially solves this 
problem. Using this modified algorithm, J estimates of each digit are 
made, each one being computed using a different combination of J — I 
out of the J parity-check sets containing the digit to be estimated. 
The value of a particular estimate is changed if J/2 or more out of 
the J — 1 parity checks are equal to one. 

To construct a correcting network based on this algorithm, one 
starts with J registers of length N. Although the assignment of digit 
estimates to register locations can be arbitrary, one would probably 
choose to assign one estimate of each of the N digits in a code word 
to the corresponding N locations in each register. Since each estimate 
of a digit, say do, is to be made on the basis of J — 1 parity-check 
sets, each containing K — 1 digits other than do, there must be (J — 1) 
-(K — 1) other digits interconnected with the input to do. Using the 
modified algorithm, it is necessary to specify not only the digits to 
be interconnected but also the appropriate estimate of each digit. 
For example, consider the parity check set (do, di1, diz, dis, dis). The 
appropriate estimate of the digit d1, to be used in correcting dp is that 
estimate based on the J — 1 parity check sets which omit the one 
containing dy. This estimate is used so that the values of the digits 
involved in computing the new estimate of dy do not, themselves, 
depend on a previous estimate of do. 

This correcting network performs many iterations. During the 
first iteration each digit is estimated on the basis of the (J — 1) 
-(K — 1) digits to which it is interconnected. Since, during the second 
iteration, the same operations are performed, the resulting second 
estimate of each digit depends on the first estimates of these (J — 1) 
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-(K — 1) interconnected digits which, in turn, depend on the initial 
estimates of a much larger set of digits. The sets of digits involved 
in making successive estimates of some digit, say do, can be repre- 
sented by means of parity-check set trees of the type shown in Fig. 6. 
The branches rising from dy represent one set of J — 1 parity-checks 
containing this digit. The interconnected nodes on the first tier of this 
tree represent the digits, other than do, in one of these parity-check 
sets. Each digit on the first tier of the tree is also contained in J — 1 
other parity-check sets. These other parity-check sets are represented 
by the branches rising from the first tier to the second tier of the tree. 
This parity-check set tree can be extended indefinitely. The structure 
of the tree, beyond the first tier, is completely specified by the parity- 
check sets of the codes which, in turn, are specified by the parity-check 
matrix. 

To see how the tree represents the digits involved in estimating 
dy, let us suppose that the decoder has performed 2 iterations. During 
each iteration the digits on each tier of the tree are used to estimate 
the digits on the tier immediately below. Hence, after 2 iterations, the 
value of each digit, particularly dy, has been influenced by the values 
of the digits on the first 2 tiers above it. 

If a parity-check set tree is extended for many tiers, eventually 
some digit will appear in two different places in the tree. If the first 
repetition within any of the J-N trees occurs on the (m + 1)th tier, 


AUINTANVANTE AVUANUANUNTE VEEN ENUENTS 
We AL AU AMAA A AEA AA 


TIER 2 


AEG GE AP NETS SIP Nee Re OP Aig 
\/ \ WW WW WW OM \) WW AW OAV 


TIER 1 
dy dis dia FP) dz das 7 dar daa das aa 
dia da, doy 


do 


do + dir + dio + diz + dy 
do + dar + dee + des + do 
dy + dai + dz + dss + du 
do + dar + dang + dys + dug = C4 


Wedd 
° 


Fig. 6— Two representations for the parity-check constraints involving do. At 
the top is one of the J parity-check set trees rising from do. Beneath it are the 
parity-check equations containing do. 
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then, according to Gallager’s nomenclature, the code has m independent 
iterations. Notice that this number, m, is a parameter of the code 
which is unrelated either to the statistics of the noise or to the par- 
ticular decoding algorithm. This number plays a particularly impor- 
tant part in the analysis of the correcting procedure as it is shown 
that errors in specific sets of digits are statistically independent 
during these first m iterations. 

The physical configuration of the memory is shown in Fig. 7. 
Within the correcting network there are J-N identical sets of com- 
ponents. Each set of components performs the operations required to 
estimate one particular digit. Let us consider a set of components 
which computes estimates of the digit dp. The first operation that 
must be performed by this set of components, the computation of the 
J—1 parity checks rising from do, requires (J—1)+-(—1) two-input 
binary modulo-2 adders. The second operation, deciding whether the 
digit dy) should be changed, can be performed by a decision device 
(threshold device) the output of which is a 1 if dp is to be changed 
and a 0 if dp is not to be changed. Finally, the output of this decision 
device must be added modulo-2 to the previous estimate of dy, the 
operation requiring one binary adder (modulo-2). Similar operations 
are performed to obtain estimates of all J-N digits. These operations 
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Fig. 7— Physical configuration of a memory based on a low-density parity- 
check decoder. 
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constitute one correcting cycle (iteration). The correcting network 
performs correcting cycles continuously once information has been 
read into the registers. 

Finally, let us consider the situations which cause the estimate of 
the digit dp to be in error after a particular iteration. Since any digit 
dy which is in error will be corrected only if a sufficient number of the 
parity checks containing dp are equal to 1, ideally, we would like 
every parity check containing dy to equal 1 if do is in error but to 
equal 0 if dp is correct. If a parity check does not equal the desired 
value we say that the parity check is in error. Thus a parity check 
error depends on errors made by the adders and errors in the digits 
involved in estimating dy, but not on the value of do itself. 

To simplify the mathematical analysis, we restrict our attention 
to the class of low-density parity-check codes with J = 2l, 1 = 2, 3, 4, 

. and the correcting algorithm stated previously. A set of events 
each one of which alone leads to an error (indicated by e) in the esti- 
mate of the digit dp after a particular iteration are: 


(i) do = e after previous iteration and J/2 or more parity checks are 
in error. 
(it) dy ¥ ¢ after previous iteration and J/2 or more parity checks are 
in error. 
(iit) The decision (threshold) device makes an error. 
The first two conditions demonstrate an interesting property of this 
class of low-density parity-check codes. For this class of codes the 
conditions leading to an error in any digit after any iteration are the 
same whether or not the digit was in error before the iteration. This 
property will help to simplify the following mathematical analysis. 


3.3 The Stability Theorem for Noisy Memories 
Theorem 1: There is a stable sequence of noisy memories where every 
component in every memory has a fixed, nonzero probability of error 
per use. 
Proof: The memories under consideration consist of J registers of 
length N, a noisy correcting network based on a low-density parity- 
check decoder, and a set of communication channels over which the 
inputs are transmitted. The definition of stability given in Section 2.2 
includes specific conditions that must be satisfied by stable memories. 
To prove that the memories under consideration are stable, we must 
show that they satisfy all these conditions. _ 

The definition of stability requires that there be a set of. allowed 
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inputs for each memory. Each allowed input for a memory of the type 
under consideration consists of J copies of a code word from some 
(N, J, K) low-density parity-check code. If R equals the information 
rate of the code, the memory having registers of length N has 2?" * 2" 
allowed inputs. This memory is denoted by M, . To define a sequence of 
memories, we allow k(and NV) to be a variable but keep the parameters 
of the code, J and K, fixed.* Let the 2° allowed inputs to M, be denoted 
by {J.;} 0 < 7S 2". With each input we must associate a class of states 
of M,. The classes to be used are essentially the decoding equivalence 
classes of the code. To be more precise, the classes can be described in 
terms of the equilibrium states of MM, , denoted by {Z,;} 0 < i S 2", 
where an equilibrium state is one in which every register contains one 
and the same code word. With each input, /,; , there is a corresponding 
equilibrium state, £,;, in which the registers contain the code words 
represented by J,;. The state of M, belongs to the class C(J,,) if a 
noiseless correcting network (that is, a noiseless low-density parity-check 
decoder) could correct all the errors; in other words, if its final state 
would be the equilibrium state, E,, . 

The definition of stability requires that for each sequence of mem- 
ories there must be an a such that for every & the complexity of M;, is 
bounded by ak. The complexity of the correcting network in M;,, is 
computed in Section 3.2. If each of the J registers in M; is of length 
N, the correcting network must contain [1 + (J—1) (K—1)] J-N bi- 
nary adders (modulo-2) and J-N decision devices. Since a decision 
device must determine whether J/2 or more out of the J—1 parity 
checks are in error, the complexity of each decision device depends 
only on the code parameter, J, which is fixed for any particular se- 
quence of memories. Thus the complexity of each decision device, 
denoted by D, is independent of k. Finally, the noisy registers within 
the memory must contain J:-N storage components. Therefore the 
complexity of M;, is: 


Complexity of M, = [2+ D+(J— DK —-Dj]J-N 
Bene ape =a 
= So ge ee 


Since Gallager’ has shown that R = 1 — J/K, the complexity propor- 
tionality factor for the sequence of memories under consideration is 
upper bounded by 


_* There are low-density parity-check codes for all values of N which are 
integer multiples of K.1° 
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< REDEV = WK = DW 
oe 1-— J/K 
Every component has a nonzero probability of making an error. 
The error probabilities are denoted: 





p, = probability that one particular binary digit within the register 
changes during a time interval 7, the time required for one 
correcting cycle. 

Pa = probability that an adder makes an error on any one use. 

pa = probability that a decision device makes an error on any one use. 

p. = probability of an error in transmitting a digit across a communi- 

cation channel. 


It is assumed that errors are statistically independent from one com- 
ponent to another and from one use of a particular component to another 
use. 

To prove that the information storage devices in question are stable, 
it must be shown that for any £, the probability of a memory failure 
in M,, during the time interval 0 < t S £7 can be made arbitrarily small 
by choosing k sufficiently large. In the remainder of this section, the 
probability of a memory failure for the typical device, M,, is upper 
bounded. To determine whether a memory failure has occurred at some 
particular time we use the following algorithm. 

Imagine that M;, becomes noiseless at that time and that the noise- 
less correcting network within M;, performs m more iterations where 
m is the “number of independent iterations” described in Section 3.2. 
These are precisely the same operations that a noiseless low-density 
parity-check decoder would perform. If the final state of the hypothet- 
ical noiseless memory is error free (that is, equals the equilibrium 
state corresponding to the original input) then, by definition, no 
memory failure occurred. Thus, to bound the probability of a memory 
failure we must bound the probability that the final error pattern is 
not error free. 

In general, the error pattern within the registers of M;, depends on 
all the component errors that have occurred since the original input 
was transmitted to the memory. However, if certain conditions are 
satisfied, the error pattern is a function of only the component errors 
that occurred during the most recent m iterations. If no component 
errors occur during these m iterations and if the required conditions 
are satisfied, the final error pattern will be error free. On the other 
hand, if these conditions are not satisfied we say that a propagation 
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failure has occurred. Therefore, to bound (upper bound) the prob- 
ability of a memory failure, we bound the probability that a propaga- 
tion failure occurs on the mth noiseless iteration of the hypothetical 
noiseless memory. 

The first step is to show that, in the absence of a propagation failure, 
the probability of error for digits stored in the memory (the probability 
of error per digit) has an upper bound, denoted by pp , such that po < 3. 
This can be seen intuitively by comparing the performance of a noisy 
corrector with that of a noiseless corrector, that is, a noiseless low- 
density parity-check decoder. Provided that the initial probability of 
error is not too large, a noiseless corrector decreases the probability of 
error per digit with each iteration until this probability reaches zero. 
If the probability of error for the components within the noisy corrector 
is small, compared with the initial probability of error per digit, for the 
first few iterations the noisy corrector decreases the probability of error 
per digit just as the noiseless one did. However, eventually this proba- 
bility reaches the same order of magnitude as the probability that the 
noisy corrector itself makes an error at which time the probability of 
error per digit reaches an equilibrium value. Notice that although such 
an equilibrium value is attained, it is still possible for errors to occur in 
sufficient digits so that a memory failure results. If a memory failure 
does occur, a propagation failure will also occur and the bound on the 
probability of error per digit will no longer be valid. 

In light of this intuitive argument, we expect that the time at which 
the probability of error per digit is at its maximum value is just 
before the end of the first correcting cycle. In Appendix A upper 
bounds are computed on this probability evaluated just before the 
end of the first and successive correcting cycles. It is shown that, 
provided no propagation failure occurs, these bounds form a mono- 
tonically decreasing sequence; hence, the bound on this probability 
evaluated just before the end of the first correcting cycle, denoted 
by po, is the desired bound. 

The next step is to bound the probability that the initial propaga- 
tion failure occurs at some particular time. A propagation failure 
occurs whenever the error pattern in the memory is related to the 
component errors that occurred m or more iterations previously. In 
most cases the error pattern depends only on component errors that 
occurred in the last few iterations since any digit errors caused by 
previous component errors would have already been corrected. In 
order for a propagation failure to occur, the effect of component er- 
rors must have propagated from one iteration to the next for at least 
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m iterations. Thus we expect that the probability of such a propaga- 
tion decreases as m is increased and, since increasing k increases ™, 
that the probability of a propagation failure can be made arbitrarily 
small by making k sufficiently large. The explicit relationship be- 
tween the probability of the initial propagation failure and k is 
derived in Appendix B. The result is 


Pr [initial propagation failure] < Ck7°** 
where C and 8 depend only on the parameters of the code (J and K) 
and the bound on the probability of error per digit, pp); hence, C and 
B are constants for any particular sequence of memories. 

Some typical values for 8 are shown in Table I. For example, if 
J = 14,K = 15, and pp = 10-8 then B = 7.55; therefore, in this case, 
the probability that the first propagation failure occurs at some 
particular time is upper bounded by a function that decreases as the 
sixth power of the information storage capability of the memory. By 
choosing the information storage capability sufficiently large, this 
probability can be made arbitrarily small. 

For the moment, let us assume that neither a memory failure nor a 
propagation failure has occurred within M, during the time interval 
0 St < &’r. We now use the bound on the probability of the initial 
propagation failure to find an upper bound on the probability that either 
the initial memory failure or the initial propagation failure occurs at 
t = £&’r. To determine whether the initial memory failure occurs at 
t = S£’r we must imagine that M, becomes noiseless at t = £’r and that 
the noiseless correcting network within 1, performs m more iterations. 
As explained previously, the initial memory failure can occur até = &’r 
only if the initial propagation failure occurs at t = £&’r or during the 
m noiseless iterations performed after t = S&’r. Thus the sum of the 


TasLEe [—Typicau VaALuEs oF B’(J, K, po) AND 


BU, K, Po): 
R 
J K (ower bound) po B B’ 
4 5 0.20 1078 2.66 66 
6 7 0.14 10-8 3.91 1.91 
8 9 11 10-8 95 2.95 
10 11 09 10-8 89 3.89 
12 13 0.07 10-8 6.75 4.75 
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probabilities that the initial propagation failure occurs at t = £&’r, 
(£’ + 1)7, --- , (&’ + m)r is an upper bound on the desired probability. 
Therefore, 


Pr [initial memory failure or initial propagation failure 
att = £'r] < (m+ 1)-C-k**?. 
Gallager’® has shown that 


log N 
< jos (7 — DK — Ny 
Therefore, 
mee Dei oe ay 
os | oe 
+1 


< Tog (J — DK — D) 


< log [iz if ae ay 
(Note: K > J 2 4). 


The probability that the initial memory failure occurs during the time 
interval 0 $ ¢ S £7 is upper bounded by the sum of the probabilities 
that either the initial memory failure or the initial propagation failure 
occurs at t = 0,7, 27, --- , &r. This bound equals 


Lr] 


Pr [failure during time interval 0 < t S 
CO. —B+1 
< (¢ + 0)-C-log | ~— Els 
Caste 
< {f+ J) +7 = UIK k 
=(©4+1-C'K" 
where 
ph C pA mS 
OS Sk and B’ = B— 2. 
To show that there is a stable sequence of noisy memories, we must 


show that it is possible to choose J, K, and po such that fp’ > 0. Re- 
call that when we speak of a particular sequence of memories, {M;}, 
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we mean that k is a variable whereas the values of J, K, and the 
probabilities of component errors are all fixed. Certain conditions have 
already been imposed on J, K and po. These conditions are: 


GQ: £22; VS 2,8p4s 
(i) K>J 
(iii) po > 2p, + dz 


(iv) = Dy > @ ie — 1)(po + pa)” + pa + py 


(v) Po > Poa 


where pa, Pa, Pr and p, must all be fixed and greater than zero. To 
demonstrate that there are values of J, K and po which satisfy these 
conditions and for which p’ > 0, consider an example where p, = pa = 
Dr = De = 10°° and where p, = 107°. For this example conditions (ii), 
(iv), and (v) are satisfied for all reasonable values of J and K (that is, 
where J < K < p>’). The values of 6’(J, K, po = 107°), which corre- 
spond to some typical values of J and K, are shown in Table I. For all 
the values of J and K which are considered, the value of @’ is greater 
than zero. Therefore, in all these cases, the probability of a memory 
failure in M, at ¢ = Sr can be made arbitrarily small by making k 
sufficiently large. This proves that there are stable sequences of noisy 
memories. 


3.4 Stability of Memories Constructed from Failure-Prone Components 

Thus far we have restricted our attention to memories in which 
component malfunctions are assumed to be statistically independent 
both from one component to another and from one use of a particular 
component to another use. These assumptions form the basis of a 
mathematical model for the component malfunctions that are com- 
monly attributed to ‘noise’ in the system. Unfortunately, the model 
does not adequately represent the most common type of component 
malfunction that one finds in computing systems: malfunctions where 
individual components fail permanently. 

To see whether memories of the type considered in the previous 
section can be stable, if their components fail permanently, let us 
recall the proof of the stability theorem for noisy memories. In carry- 
ing out this proof, it is necessary to show that there are types of 
memories for which the probability of error per digit can be bounded 
(see Appendix A). If one attempts to find memories in which the 


2324 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1968 


components fail permanently and for which a similar bound on the 
probability of error per digit exists, it becomes clear that memories 
constructed from these components cannot have such a time inde- 
pendent bound of value less than 14. This is because the probability 
that any particular component has failed increases with time given 
that components fail permanently; hence, if one hypothesizes any 
bound on the probability of error per digit which is less than 1, it is 
always possible to find a time at which the hypothesized bound is 
violated, showing that such a bound cannot exist. By using argu- 
ments such as those in Section 2.3, one can obtain a statistical model 
for the errors after each correcting cycle in terms of an equivalent 
channel whose capacity decreases with time. Since the capacity de- 
creases, it is always possible to find a time at which the capacity of 
the equivalent channel is below the information rate of the code used 
in storing the information in the memory, thus precluding the pos- 
sibility of effective error correction and hence the possibility of 
stability. 

Fortunately, in most “nonspace” applications, regular maintenance 
is performed on computing systems, that is, components which have 
failed are periodically replaced with good ones. Numerous specific 
failure probability distributions and maintenance schemes could be 
considered individually; however, for the purpose of this analysis 
we consider instead a general case which includes many of the com- 
mon probability distributions and maintenance schemes. This gen- 
eral case is the one for which it is possible to upper bound, during 
each correcting cycle, the probability that each component has failed 
up to or during that correcting cycle. For example, suppose that each 
component is replaced every 7’ seconds and that p, represents the 
probability that any particular component initially fails during any 
particular correcting cycle. For this example, the desired upper bound 
on the probability of component failure equals T’-p; which, by ap- 
propriate choice of T and p;, can be made less than 14. Notice that 
we are free to choose both T and p; since, as before, we are only try- 
ing to show that there exists some memory of the type under consid- 
eration which is stable. 

One can now perform an analysis identical to that performed in 
the previous section. Since the technique used to prove the stability 
theorem for noisy memories does not rely upon the assumption that 
component errors are statistically independent from use to use, pre- 
cisely the same technique used previously can be used here to prove 
that periodically maintained memories can be stable. In fact, in most 
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cases, the changes required to make this proof apply when compo- 
nents fail permanently merely involve replacing the words “com- 
ponent error” with the words “component failure.” 

One change which requires some reinterpretation of terms involves 
the concept of a propagation failure. This concept was introduced for 
the purpose of establishing a condition under which the parity checks 
used to estimate any particular digit would be conditionally inde- 
pendent. To facilitate an intuitive discussion of propagation failures, 
the original definition of a propagation failure was made more gen- 
eral than necessary for the mathematical analysis. This analysis, in 
Appendix B, uses the fact that undesirable statistical dependencies 
can occur only if the effects of previous component malfunctions form 
a A propagation path in some parity check set tree. It is now desirable 
to redefine a propagation failure in terms of the formation of such a 
A propagation path. This is because permanent component failures 
can result in recurrent errors in a particular digit during m or more 
iterations thus causing a propagation failure, according to the original 
definition. However, since these recurrent errors do not lead to the 
undesirable statistical dependencies unless they also correspond to an 
undesirable A propagation path, the original definition of a propaga- 
tion failure should be changed to exclude recurrent errors. 

Once these changes have been made, if one represents the bounds on 
the probabilities of component failures by the same symbols that 
were used previously to represent the actual probabilities of com- 
ponent errors during each iteration, not only is the method of prov- 
ing the stability theorem identical to that used previously but so 
are the forms of all the results. Thus one proves the following theorem. 


Theorem 2: There is a stable sequence of memories where every com- 
ponent in each of the memories in the sequence has a nonzero prob- 
ability of permanent failure but where components which have failed 
are periodically replaced with good ones. 


IV. CONCLUSIONS 


In Section I we compare the results obtained by Shannon con- 
cerning the reliability of communication systems with the results ob- 
tained by von Neumann, Elias, Winograd, and Cowan concerning the 
reliability of computing systems. Shannon’s results were basically 
different from the other results considered. Shannon was able to show 
that it is possible to design arbitrarily reliable communication sys- 
tems through which information can be transmitted at a nonzero 
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information rate. The maximum rate for which the probability of 
error can be made arbitrarily small is called the capacity of the com- 
munication channel. In analogy with this result, one might expect 
that it should be possible to design arbitrarily reliable computing 
systems which have a bounded redundancy. Unfortunately, none of 
the computing systems proposed previously has this property; there- 
fore, none of these computing systems has a nonzero “computing 
capacity.” 

In Sections II and III we restrict our attention to one part of a 
computing system, namely the memory. It is shown that there are 
noisy memories and periodically repaired memories constructed from 
failure-prone components which have the property that the prob- 
ability of failure can be made arbitrarily small for certain bounded 
values of the redundancy. The memories which have this property 
are called “stable memories.” This result is analogous to Shannon’s 
result and is basically different from the other results obtained thus 
far concerning the reliability of computing systems. The fact that it 
is possible to make a memory arbitrarily reliable while keeping its 
redundancy bounded indicates that a memory has an “information 
storage capacity” analogous to the capacity of a communication 
channel. The information storage capacity of a particular memory 
equals the reciprocal of the minimum redundancy for which the 
memory is stable; hence it can be expressed in bits per component. It 
is a function of the probabilities of error for the components within 
the memory. The method used to prove the stability theorem does 
not allow one to compute an explicit value for the information storage 
capacity of the memories which were considered; however, the fact 
that these memories can be stable indicates that they do have a non- 
zero information storage capacity. 
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APPENDIX A 


A Bound on the Probability of Error Per Digit 


The first step in computing a bound on the probability of a propa- 
gation failure is to bound the probability of error for any digit stored 
in the registers within the memory. We assume that initially one set 
of J code words is transmitted across the noisy channels and inserted 
into these noisy registers. Some time during the next + seconds the 
correcting network reads the contents of the registers and starts to 
perform the first correcting cycle on the newly inserted digits. The 
time at which this first correcting cycle starts is denoted by ¢ = 0 and 
successive correcting cycles start at t = r, 27, 37,.. . . We denote the 
instant just before the end of the first correcting cycle by ¢ = r—8. If a 
digit is in error at t = r—8, at least one of the following events must 
have occurred: 

(z) An error was made in transmitting the digit across the BSC. The 
probability of this event is p, . 

(iz) The digit was changed because of a component error in the register 
which occurred either during the time interval —-7r << tS Oor0 <iS 
7 — 6. The probability of this event is less than 2p, . 

Therefore, by the union bound, the probability of error per digit at 
= r — 6 is bounded by. 


Pr [digit = «ati = 7 — 6] < 2p, + pv. < Do 


where the parameter po has been introduced to simplify the form of 
the results. Other conditions on po will be imposed later. 

Next let us compute a bound on the probability of error per digit 
at ¢ = 27—8, 3r—5,... . If the digit dp is in error at any one of these 
times, at least one of the following events must have occurred: 


(1) A set of J/2 parity checks used to estimate dy were in error 
during the last correcting cycle performed on do. 

(11) The decision device made an error during the last correcting 
cycle. 

(zw) An error occurred while dy was stored in the register. 


There are @ I/2 ') possible events of the first type. We now compute 


the probability that any one of these events occurred. If the zth parity 
check used to estimate d) , c; , were in error there must have been at 
least one error among the K — 1 adders used to evaluate this parity 
check, or at least one error among the K — 1 digits denoted by d;, 


? 
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diz, °** , djx-, . According to the union bound, the probability that c; 
(for any 0 < 7S J — 1) is in error is bounded by 


6) Ge = Apis 


K-1 
Pr [ce; = €] < 2 Pr (di; 
i=1 


e|< poforalOQ << js K-11; 


I 


During the first correcting cycle Pr [d;; 
therefore, for this iteration 


Pr [c; = e€ during first correcting cycle] < (K — 1)p. + (K — 1)p,. 


To compute the probability of error per digit just before the end of 
the second iteration, we use the fact that, during the first m iterations, 
the structure of the code guarantees that errors in the parity checks used 
to estimate any particular digit are statistically independent. To see 
this, consider the parity-check set tree rising from the digit d) as shown 
in Fig. 6. Each node on this tree represents a particular digit. With each 
digit there is associated a set of components used to compute each 
estimate of the digit. Just as the tree represents a history of the digits 
which have been involved in the computation of the successive estimates 
of dy, it also represents a history of the components which have been 
involved in the computation of these estimates. The later interpretation 
is more useful for our purposes since it is the components which cause 
the errors. If the code has m independent iterations, all the digits on the 
first m tiers of the tree must be different and all the components asso- 
ciated with these digits must be different also. There are (K — 1)(J — 1) 
digits on the first tier of this tree. Provided that m = 1, the errors in 
these digits after the first iteration must be statistically independent 
since the digits are all different, and hence the components used to 
compute the estimates of these digits are all different. (It is assumed that 
errors in different components are statistically independent.) In general, 
the errors in the digits on the first tier of the tree, and hence the J — 1 
parity checks, are statistically independent provided that the sets of 
components used to compute the estimates of these digits are disjoint. 
The structure of the tree guarantees that this condition will be satisfied 
for the first m iterations. 

Since, during the first m iterations, the errors in the parity checks 
used to estimate any particular digit are statistically independent, 
the probability that a set of J/2 parity checks is in error equals the 
product of the probabilities that each one of these J/2 parity checks 
is in error. Thus the probability of error per digit at t = 27 — 8 is 
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bounded by 


Pr {digit = e« at t = 27r— 4 
J—1 2 
< (7775 UK = 0 + pa’? + Bs +P, 


4 
= —Pi . 

Since we are not attempting to show that all memories of the type 
under consideration are stable but merely that there exist some mem- 
ories of this type which are stable, we shall restrict our interest to 
those memories for which it is possible to make p, < po. This is not 
a serious restriction since in most cases there is no difficulty in bound- 
ing p; by po. For example, if 


Da) Pr and py = 107°, p» = 107°, J = 14andK = 15, then p, % 2.107° 
illustrating one case where p, < po. 


Precisely the same argument can be used to obtain a bound on the 
probability of error per digit at t = 3r—8, 4r7—5, .. , (m+1)r—8. The 
results are: 


Pr [digit = « at t= 37r— 4] 


A 


ee NK — 1)(~. + pl"? + pa + Dr 


Il> 


P2 <Di <Po- 
Similarly 


Pr [pigit = « at ¢=(m+1)r — §] 
= co. NK — I(pn-1 + pa)” + pa + Dr 


= Dn << Dani — < pi <Po- 


If no propagation failure occurs at ¢ = mr, the error pattern 
evaluated at that time depends on the component errors that occurred 
during the previous m iterations, but not on the original errors that 
were present at t = 0. Imposing this condition changes the probability 
of error per digit at ¢ = (m+1)7—8; however, as we now show, the 
probability computed above is an upper bound on this conditional 
probability. Using Bayes rule, 
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Pr [digit 


HT 


e | no propagation failure] 


Pr [no propagation failure | digit = e]- Pr [digit = «] 


Pr [no propagation failure] 


It is easily shown, using techniques similar to those in Appendix B, 
that 


Pr [no propagation failure | digit = «] < Pr [no propagation failure], 
therefore 


Pr [digit = ¢ | no propagation failure] < Pr [digit = €]. 


If no propagation failure has occurred, the errors in the set of parity 
checks used to estimate any particular digit are conditionally in- 
dependent. This is because, if there is no propagation failure, the er- 
rors in each set of parity checks depend on errors in unrelated, dis- 
joint sets of components. Thus the technique used to bound the 
probability of error per digit during the first m iterations can be ap- 
plied after the mth iteration provided that no propagation failure has 
occurred. The general result is 


Pr [digit = «até = (¢ + 1)7 — &| no propagation failure] 
J—1 a J/2 
< ( I/2 as 1)(pi-1 + pal + pa + 


=p; < pi << <M. 

This monotonically decreasing sequence of bounds shows that the 
probability of error per digit is upper bounded by yo provided that 
it is possible to choose po, J, and K such that p,; < po and provided 
that no propagation failure occurs. In general, the probability of 
error per digit is upper bounded by »p; provided that the memory has 
performed 7 or more correcting cycles and provided that the condi- 
tions stated above are satisfied. 


APPENDIX B 


The Probability of a Propagation Failure 


Intuitively, the concept of a propagation failure is very simple. A 
propagation failure occurs whenever the present error pattern in. the 
registers is in some way related to the component errors that oc- 
curred more than m iterations before, where m is the number of in- 
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dependent iterations. This intuitive concept can be made more pre- 
cise by defining “error configurations” which are hypothetical error 
patterns that are functions of some subset of the actual set of com- 
ponent errors. In particular, a type 7 error configuration, for any 1 
> 0, is a J-N-tuple corresponding to the error pattern that would be 
present if there had been no component errors before the last 7 itera- 
tions. If no propagation failure has occurred, all error configurations 
of type m and higher must be identical, thus we are really interested 
in the difference between error configurations. For this reason, it is 
more convenient to restate the definition of a propagation failure in 
terms of “A configurations” where, for all 7 > 0, the type 1 A configura- 
tion is the difference between the type 7 and type i+1 error configura- 
tions. The type 0 A configuration is defined to be equal to the type 1 
error configuration. A propagation failure occurs if there are one or 
more 1’s in any A configuration of type m or higher. 

Let us consider the situations which, for any 7, lead to a 1 in the 
type 2 A configuration evaluated at some particular time, say t = 
Lr. Suppose that there is a 1 in the position corresponding to the digit 
dy in this A configuration. This means that the value of the digit do 
in the type 7 error configuration is different from that in the type 
1+1 error configuration where both configurations are evaluated at 
t = Lr. This change in the value of dy must be related to component 
errors that occurred during the time interval (L—i—1)r < t < (L—1)r 
since all the other component errors upon which the type z and type 
1+1 error configurations are based are the same. We refer to the com- 
ponent errors that occurred during the time interval (L—1—1)r < t < 
(L—1)r as the controlling errors for the type 2 A configuration eval- 
uated at ¢ = Lr and we say that the value of dy at ¢ = Lr has been 
changed by these controlling errors. 

If the value of dp is changed at ¢ = Lr, the controlling errors must 
have changed at least one of the digits used to estimate dy. These are 
the digits on the first tier of the parity-check set tree rising from do, 
and changes in them are represented by 1’s in the appropriate posi- 
tions in the type 7—1 A configuration evaluated at t = (L—1)r. In gen- 
eral these controlling errors must have caused changes in some digits 
on the /th tier of the parity-check set tree, the changes being rep- 
resented by 1’s in the appropriate positions in the type 71—l A con- 
figuration evaluated at t = (L—1l)r. These changed digits define at least 
one continuous path in the parity-check set tree rising 2 tiers from 
dy. We call these paths A propagation paths. The particular A prop- 
agation path which has just been described is referred to as the 2-tier 
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A propagation path rising from dy at t = Lr (see Fig. 8). Every 1 ina 
type 2 A configuration must have at least one 7-tier A propagation 
path associated with it. To bound the probability that there is a 1 in 
the entry corresponding to the digit dy in the type 7 A configuration 
evaluated at ¢ = Lr, we shall bound the probability that one or more 
i-tier A propagation paths rise from the digit dp at t = Lr. 

Since A propagation paths must be continuous, for each 1 in a type 
1 A configuration evaluated at ¢ = Lr there must have been at least 
one 1 in a type i—1 A configuration evaluated at t = (L—1)r. If no 
propagation failure occurred before ¢ = Lr, the A configurations of 
type m and higher must have beeen all zero for all t < Lr. This im- 
plies that the A configurations of type m+1 and higher must be all 
zero at t = Lr. Hence, the only way that the initial propagation fail- 
ure can occur at ¢ = Lr is if there is a 1 in the type m A configura- 
tion evaluated at that time. Therefore, to bound the probability that 
the first propagation failure occurs at some particular time, we need 


3 ; (L-ti-1)r TIER U+1 
| TIME INTERVAL DURING 
| WHICH CONTROLLING 
ERRORS OCURRED 


ti (L-i)r TIER i 






_-NEW ERROR 


t= (L=e+1)r TIER L-1 
1 __ FOR ANY OSUt, AT 
1 ger Gal) e THERE 18 
ut A‘ IN THE TYPE (i-1) 
A CONFIGURATION IN THE 
t= (L-\)r TIER L POSITION CORRESPONDING 
TO THE CIRCLED DIGIT. 
\ 
\ 
\ 
t=(L-3)r TIER 3 
t = (L-2)¢r TIER 2 
t= (L-1)r TIER | < 
ive a 
do 


Fig. 18— An example of an 7-tier A propagation path rising from do at ¢ = Lr. 
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only bound the probability of one or more 1’s in the type m A con- 
figuration evaluated at that time. Since we assume that the memory 
fails whenever the first propagation failure occurs, we shall never be 
concerned with computing the probability of a 1 in any A configura- 
tion of type m+1 or higher. It is important that we can restrict our 
attention to the type m A configuration since this A configuration can 
be computed for any particular time by considering only the compo- 
nent errors that occurred during the previous m-+1 correcting cycles. 
The first of these correcting cycle results in statistically independent 
digit errors and, as explained in Appendix A, the structure of the 
code guarantees that during the next m correcting cycles the errors in 
the parity checks used to estimate any digit are statistically inde- 
pendent. 

To bound the probability that the initial propagation failure oc- 
curs at ¢ = Lr, we must compute a bound on the probability of one or 
more 1’s in the type m A configuration evaluated at t = Lr. This is 
done by bounding the probability that an m-tier A propagation path 
rises from one or more of the J:N digits in the registers within the 
memory at ¢ = Lr. At first we restrict our attention to one particular 
digit dy. A bound is computed on the probability that an m-tier A 
propagation path rises from dy at t = Lr. The first step in this com- 
putation is to bound the probability that the component errors that 
occurred during the time interval (L—m—1)r < t < (L—m)r would 
cause any particular digit on the mth tier of the parity-check set tree 
rising from dy to be in error at t = (L—m)r. Any m-tier A propaga- 
tion path rising from dy at t = Lr must terminate on one of these er- 
rors which we call new errors. The next step is to bound the prob- 
ability that at ¢ = (Z—m-+1)r an 7-tier A propagation path rises from 
any particular digit on the (m—1)th tier of the parity-check set tree 
rising from dy. This probability is denoted by Pr[dni:=Ai]. By sub- 
stituting m for 2, we obtain a bound on the probability that at ¢ = 
Lr an m-tier A propagation path rises from the digit dy (that is, 
Pr[do = Am]). The probability that an m-tier A propagation path 
rises from one or more of the J-N digits at ¢ = Zr is upper bounded 
by J-N:Pr[do = An]. 

The first step, namely bounding the probability of a new error at 
t = (L—m)r, is particularly simply. In Appendix A we computed a 
bound on the probability of error per digit which was denoted by 7p. 
Since this bound was computed by considering all possible errors that 
could exist at a particular time, it must certainly be a bound on the 
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probability of a new error at some particular time. Therefore, po is a 
bound on the probability that a new error occurs at = (L—m)r. 


The next step is to bound Pr [d,,_-; = A] for all1 S$ 7 S m. Let us 
consider a particular digit on the (m — 17)th tier of the parity-check set 
tree rising from d, and denote this digit by d,,_;. Now consider the 
conditions which must be satisfied if the value of d,,_; is changed by the 
controlling errors; that is, if an 7-tier A propagation path rises from d,,_; 
att = (L —- m+ 24)r. 

To describe this ‘‘change’”’ in more detail, we must consider two sets of 
component errors. One is the set of component errors that occurred 
since t = (L — m — 1)7 and the other is the set of component errors that 
occurred since t = (L — m)z [assume that no component errors occurred 
before t = (L — m — 1)randt = (L — m)r, respectively]. When we say 
that the value of d,,-; has been changed by the controlling errors, we 
mean that the value of d,,_; at ¢ = (LZ — m + 7)r is correct when it is 
computed under the assumption that one of these sets of component 
errors actually occurred, whereas it is incorrect when it is computed 
under the assumption that the other set of errors actually occurred. 
There are two necessary conditions for this change. Assume that the 
value of d,,-; was changed by the controlling errors. Denote the set 
of component errors for which d,z_; = ¢ by §&, and the set for which 
Am-i F € DY Scorrect - If the value of d,,-; is changed by the controlling 
errors, both of the following conditions must be satisfied: 


(t) There must have been at least one parity check used to estimate 
dm-; Which was wrong [at ¢ = (L — m + 7%)7] under the assumption 
that &, occurred but which was correct under the assumption that 
Scorreet occurred. Denote one of these parity checks by cy . 

(zz) On the basis of the errors in the set §, , J/2 — 1 or more parity 
checks other than c, must have been wrong at ¢ = (L — m + 1)r. 

In order for condition 7 to be satisfied, the value of at least one of 
the (J—1) (K—1) digits immediately above d,», in the parity-check 
set tree must have been changed [at ¢ = (L—m-+1—1)r] by the con- 
trolling errors. The probability of such a change has been denoted by 
Pr[dmit1 = Ais]. Therefore, the probability that the value of one or 
more of these digits was changed is upper bounded by 


Pr [condition 7 is satisfied] 


< Pr [value of any digit immediately above d,,; is changed by con- 
trolling errors] 


< (J — 1)(K — 1) Pr [d,-441 = A;-1]. 
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A bound on the probability that condition 2 is satisfied was derived 
in Appendix A. This bound equals 


Pr [condition (77) is satisfied] < ( ie ? Jue = 1)(p. + p.)?". 


As explained previously, the structure of the code guarantees that 
parity-check errors are independent; hence, during the m iterations 
of interest, these two conditions are independent. Therefore, the prob- 
ability that both conditions are satisfied, which is a bound on Pr[dm4 
= Aj], is given by 


Fr ee = A,] < (J = Dik = 1) Pr ducceey = A;-1] 


oe ? Ia = 1)(po + pa)". 


Substituting 1 = 1,2...m, we obtain 


Prddee; 


I 


Al <= DE Del jy” Jl — Doe + pale 


Pr [dn-2 = Ao] < (J — DK — I) Pr [d,,-, = Ai] 


ee : cK = (po + p.)}°?" 
" p(s — 1K - 1) 


hace Dine + pov} 


Pr [do = An] < pad =I =A) 


ae ? lax - 1)(po + pa)]”’ ae 


Gallager has found a technique for constructing low-density parity- 
check codes*® with m, the number of independent iterations, bounded 


by 


i |x = ea | 
°8 19K Q(K — 1) 
2 log (J — 1I)(K — 1] ~ 


_ log Ne 
~ log (J — 1)(K — 1] 


A 


m 
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Substituting this lower bound into the equation for Pr[dy = A,»] gives 


Pr [d, = A,,] < pA(d — D« — ay 


i {(N/2K)-(N/2J (K-1)1}/2 log {(J-1) (K=-1)] 


‘((K — 1)(2p,)]"7* 


- pala - ea} 


toe {or — noc — (7,7? tac — neem} 
pe Fg = DE 


We have assumed that po has been chosen such that pp > p,. For any 
L, probability that the initial propagation failure occurs at ¢ = Lr is 
bounded by 


where 


Pr [initial propagation failure occurs at ¢ = Lr] 
= 0 for L<m 
<J-N-Pr [db = A,,] for L=m 


since, by definition, a propagation failure cannot occur before ¢ = mr. 

We have chosen to number the memories according to their informa- 
tion storage capability, k. We can express this result in terms of k 
by noting that N = k/R andl — J/K S R S 1; therefore, 


Pr [initial propagation failure occurs at ¢ = Dr]. 


= 0 for L<m 


Spe, {| Z: seca lt for 2 
Ll Pe MOK. SI ST cate Ais 
. oe 


= O-0 


where 


| ee ee | - aoa" 
Tp agik oo LOR Oko: Il," 


Both C and £8 are functions of J, K and po. For any particular se- 
quence of memories, J, K and pp will all be constants. For example, if 
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J = 14, K = 15, and po = 10°, then 8 = 7.55; therefore, in this case, 
the probability that the first propagation failure occurs at t = Lr (for 
any L) is bounded by a function that decreases as the sixth power of 
the information storage capability of the memory. By choosing the 
information storage capability sufficiently large, this probability can 
be made arbitrarily small. 
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Reliable Computation in Computing 
Systems Designed from 
Unreliable Components* 


By MICHAEL G. TAYLOR 
(Manuscript received April 10, 1968) 


This is the second of two papers which present information-theory-type 
results pertaining to the reliability of computing systems designed from 
unreliable components. Two models for component malfunctions are con- 
sidered. The first 1s based on the assumption that malfunctions of a partic- 
ular component are statistically independent from one use to another. The 
second ts based on the assumption that components fail permanently but that 
the components which have failed are periodically replaced with good ones. 
In both cases, malfunctions in different components are assumed to be 
independent. Just as a channel capacity is defined for communication 
channels, a computing capacity is defined for computing systems. For both 
conyponent failure models, it is shown that there are computing systems, 
designed entirely from unreliable components of the assumed type, which 
have nonzero computing capacities. 


\ 


I. INTRODUCTION 


The objective of this paper is to show that it is possible for a com- 
puter, designed entirely from unreliable components, to perform op- 
erations reliably on information stored in stable memories. The con- 
cept of a stable memory is introduced in the paper preceding this, 
where it is shown that it is possible to store information reliably in 
memories constructed entirely from unreliable components. Two dif- 
ferent models for component malfunctions are considered. The first 
is based on the assumption that component malfunctions are statis- 
tically independent from one use of a particular component to an- 
other use. The second is based on the assumption that components 

* This work, which is based on part of a doctoral thesis submitted to the 
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fail permanently but that the components which have failed are 
periodically replaced with good ones. In both cases component mal- 
functions are assumed to be statistically independent from one com- 
ponent to another. For both component malfunction models it is 
shown that there are types of memories, called “stable memories,” 
that have a nonzero information storage capacity; that is, for cer- 
tain fixed values of the memory’s redundancy, the probability of a 
memory failure can be made arbitrarily small. A particular type of 
stable memory is considered in some detail. 

Since the operation of the computers to be described in this article 
is closely related to the operation of these stable memories, let us 
look briefly at these memories. They consist of several registers and 
a correcting network as shown in Fig. 7 of Ref. 1. The registers store 
information which is coded according to a low-density parity-check 
code? and the correcting network periodically monitors the contents 
of the registers, corrects errors and reinserts the corrected words into 
the registers. The correcting network is very similar to a low-density 
parity-check decoder. Such a decoder, if constructed from reliable com- 
ponents, decreases the probability of error for digits stored in the 
registers with each successive correcting cycle (iteration) provided 
that the initial probability of error is not too large. 

In order to understand the operation of a low-density parity-check 
corrector constructed from unreliable components, let us suppose that 
the initial probability or error for stored digits is somewhat higher 
than the probability of malfunction for any component in the correc- 
tor. For the first few interations the corrector decreases the prob- 
ability of error per digit almost as much as the reliable decoder does. 
However, eventually this probability becomes comparable to the 
probability that a new error is made by the correcting network itself. 
Thus an equilibrium probability of error per digit is established such 
that the probability of error per digit before and after each correct- 
ing cycle is the same. Figure 1 is typical graph of this probability. 

Although the probability of error per digit approaches an asymptotic 
value, it is still possible for a memory failure to occur. A memory failure 
occurs whenever the configuration of errors within the registers of the 
machine is such that a noiseless decoder would be unable to correct all 
the errors. The probability of a memory failure during the time interval 
0 st £&ris calculated in Ref. 1 and it is shown to have the same form 
for both component malfunction models considered. For any k and &, 
the probability of a memory failure in M, , a memory of the type under 
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consideration which can store k bits of information, is given by 
Pr{failure of M, during 0 < t S$ &7] < (@ +:1):C'-k”’ 


where C’ and #’ are constants for any particular type of memory, or 
to be more mathematically precise, for any particular sequence of 
memories, {/;}, where the members of the sequence are ordered ac- 
cording to their information storage capabilities. It also is shown that 
it is possible to make f’ > 0; in fact, a numerical example is presented 
where 8’ = 5.55. Therefore, for this example, it is possible to make 
the probability of a memory failure arbitrarily small by choosing k, 
the number of bits stored, sufficiently large. Furthermore, it is shown 
that increasing k does not increase the redundancy of the memories 
being considered, thus completing the proof of stability for these 
memories. 

Our objective is to show that it is possible to design a reliable com- 
puter which performs arithmetic operations on operands stored in 
stable memories and which presents the result in a form that can 
itself be stored in a stable memory. The latter condition assures that 
successive arithmetic operations can be performed by computers of 
this type. Just as stability is defined to be a property of sequences of 
memories, reliability will be defined to be a property of sequences of 
computing systems. 
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Fig. 1— The probability of error per digit. 
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II. DEFINITION OF RELIABILITY 


The computing systems we are considering consist of a number of 
stable memories for information storage and some logic circuits which 
perform the arithmetic operations. The operations are performed on 
operands which have been stored in stable memories and the results 
of the operations are stored in stable memories. Before it is possible 
to define “reliability,” it is necessary to establish a criterion for deter- 
mining whether a particular result is correct. We consider a result 
(coded) to be correct if a noiseless low-density parity-check decoder 
could correct all the errors; that is, if it could obtain the desired un- 
coded result. Thus, the class of correct results is precisely the decod- 
ing equivalence class which contains the code word whose informa- 
tion digits correspond to the desired uncoded result. 

Now consider a sequence of computing systems {S;} where, for 
any k, the computing system S; contains memories having an in- 
formation storage capability of k binary digits. We require that all 
the computing systems in {S;} be able to perform the same set of 
operations. The sequence {8;,} is called relzable if it satisfies the fol- 
lowing conditions: 


(z) For all k, the redundancy of S, must be less than a where a is 
a constant independent of k. The redundancy of a computing system 
is defined as the ratio of the complexity of the system to the amount 
of computation performed by the system.” 

(iz) For any 3 > 0 and 6 > 0, there must be a member of {8S;} for 
which the probability that the result of any sequence of 3 operations 
(within the allowed set) will be in error is less than 6. 

The reciprocal of the minimum redundancy for which a particular 
sequence of computing systems is reliable is called the computing 
capacity for these systems. 


III, OPERATION OF VECTOR ADDITION MODULO-2 


The operation of vector (bit-by-bit) addition modulo-2 is con- 
sidered first because low-density parity-check codes have the prop- 
erty that when this operation is performed on two code words, the 
result is always another code word. This is the only nontrival opera- 
tion that can be performed without using some elaborate procedure 
for generating the check digits required to form the coded result. For 


* Ref. 1 gives precise definitions of “complexity” and “amount of computation.” 
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this reason, the operation of modulo-2 addition is particularly easy 
to perform. 

The computing system to be considered consists of stable memories 
which store the operands and the results, and logic circuits which perform 
the arithmetic operation. It is assumed that all components within the 
computing system have a nonzero probability of malfunction. Let the 
stable memories containing the operands for one particular operation be 
denoted by M{ and M/’ . At t = T, the operation of vector addition 
modulo-2 is performed on the contents of these memories and the result 
is stored in another stable memory. It is assumed that all the memories 
within a particular computing system are physically identical. This 
means that all memories must have the same set of possible states; 
furthermore, since the states of a stable memory are divided into classes 
of equivalent states, the classes of states must be the same for all of these 
stable memories. Let us suppose that at t = T, the state of M{ belongs 
to C(I,;,), the class of states containing the code word I,;, , and that the 
state of 11%’ belongs to C(J,,;,). If the operation is performed correctly, 
the state of the stable memory containing the result will belong to 
C(Iet, B Lui). 

To show that computing systems of this type are reliable, we must 
show that a sequence of 3 operations can be performed with an overall 
probability of error that can be made arbitrarily small by choosing k 
sufficiently large while keeping the redundancy fixed. To compute the 
redundancy we evaluate the ratio of the complexity of the system to the 
amount of computation performed by the system. Each modulo-2 vector 
addition performed by the system involves three stable memories, each 
with an information storage capability of k, and a number of modulo-2 
adders equal to the number of information storage components in one 
stable memory. 

In Ref. 1 it was shown that the complexity of a stable memory is 
proportional to k; hence the complexity of these three memories and 
the associated modulo-2 adders must also be proportional to k. The 
amount of computation, that is, the number of two-input binary 
operations that an equivalent irredundant computer would perform, 
equals & since this irredundant computer would perform k additions 
(modulo-2) on its two k-bit operands. Since both the complexity of 
the systems being considered and the amount of computation that 
they perform are proportional to k, their ratio, the redundancy, is 
independent of k as required. 

Let us start by considering just one operation performed by the 
computing system. We assume that before the operation was performed 
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both Mj and M!/’ performed at least one correcting cycle on the stored 
operands. Thus, according to Fig. 1, the probability of error for digits 
stored in each of these memories is upper bounded by 7p, which, in 
general, is very small compared with the maximum allowable probability 
of error per digit. After the operation has been performed, the probability 
of error (e) for digits in the memory storing the result is upper bounded 
by 


Pridigit = «] < 2p, + p. + 2p, 


since any one of the following events could lead to an error in one 
particular digit: 


(t) The corresponding digit was in error in M{(probability < p,). 

(iz) The corresponding digit was in error in M/’(probability < p,). 

(iit) The adder that performed the operation on these digits made an 
error (probability denoted by p,). 

(iv) An error occurred in this particular digit position in the result 
memory between the time the result was stored and the end of the first 
correcting cycle performed on the result (probability S< 2p,). 

Provided that the resulting probability of error per digit is less 
than the maximum allowable value, successive iterations of the re- 
sult memory decrease this probability as shown in Fig. 1. After the 
result memory has performed one iteration, the contents of this 
memory can be used as an operand in a second operation. Successive 
operations can be performed provided that at least one correcting 
cycle is performed on each intermediate result. 

An error is made on one of these operations if the state of the 
result memory does not belong to the desired class of states, or 
equivalently, if a memory failure occurs within the result memory im- 
mediately following the operation. The method for computing the 
probability of such an error is almost identical to that used in Ref. 
1 to compute the probability of a memory failure. The result, which 
is obtained in the Appendix, is: 


Pr[any @ operation is performed incorrectly | all 


previous @ operations were performed correctly] 
ee ee : 88 
(1 — J/K)’ 


where C and £ are functions of the parameters of the code (J and K) 
and 0, the bound on the probability of error per digit. For any. par- 


< 
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ticular sequences of computing systems, J, K, and po, and hence C 
and 8, are all constants. This result applies for either of the component 
malfunction models discussed in the introduction. 

Finally we must bound the probability that an error occurs on any 
one of a sequence of 3 operations. It is assumed that neither a memory 
failure nor a propagation failure occurred in any stable memory between 
the time when an operand was originally stored and the time when the 
first operation was performed on that operand. We need not concern 
ourselves about the concept of a propagation failure except to notice 
that the bounds on the probability of a memory failure were actually 
derived by bounding the probability of either a memory failure or a 
propagation failure. Hence, imposing the condition of no propagation 
failure does not make the requirements any different from those which 
have already been used. The probability of an error during 3 operations 
is upper bounded by the sum of the probabilities that the initial error 
occurs on any one of these operations; that is, 


Prferror during a sequence of 3 @ operations | no 
memory failure or propagation failure in 


memories containing the original operands] 


Cc —~B+3 
< I ge . 
If J = 14, K = 15, and p. = 107° then 6 = 7.55; therefore, for this 
sequence of computing systems, the probability of an error in 3 vector 
modulo-2 additions can be made arbitrarily small by choosing k suffi- 
ciently large, thus providing that this sequence of computing systems 
is reliable. 


IV. GENERAL VECTOR OPERATIONS 


Consider a sequence of computing systems, {S;}, in which each sys- 
tem is capable of performing many different operations. The inputs to 
each of these systems are stored in stable memories, as before; how- 
ever, the inputs must specify not only the operands but also the 
desired operations. The digits stored in each memory are coded ac- 
cording to a low-density parity-check code, but now it is assumed 
that the code is in systematic form; that is, the information digits 
appear first in each code word. When an operation is performed on 
two code words, the desired result is the code word whose information 
digits are computed by performing the desired operation on the in- 
formation digits of the two operands. 


2346 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1968 


The allowed operations for the computing systems under considera- 
tion consist of any vector (bit-by-bit) operation on the information 
digits of the operands. That is, the operation performed on a particular 
pair of information digits in the operands can be any one of the 16 
Boolean functions of two variables. Each of these operations is assigned 
a four-bit operation code. Since different operations can be performed on 
different pairs of information digits in the operands, the computing 
system S, , which contains memories having a storage capability of k 
bits, is able to perform (16)* different operations on any pair of operands. 
The desired operation is specified by means of four code words. For all 
0 <isk, the set of four digits in the 7th digit position in each of these 
four code words gives the operation code for the operation to be per- 
formed on the 2th information digit in each operand. For each pair of 
information digits in the operands, the computing system first selects 
the operation specified by the operation code and then performs this 
operation on the appropriate pair of digits in the operands. In this way 
all the information digits in the result are computed. 

Since the desired result is a code word, it is necessary for the 
computing system to generate the appropriate check digits to go with 
the desired information digits of the result. The operation of vector 
addition modulo-2 is particularly easy to perform on the information 
digits of two code words because the appropriate check digits can be 
generated by performing the operation of vector addition modulo-2 
on the check digits of the two operands. In general, it is more diffi- 
cult to generate the appropriate check digits. 

We consider first a method by which a noiseless system could generate 
the check digits, then show that by making a rather simple modification 
to this method, it is suitable for use in a noisy system. Finally we bound 
the probability that an error occurs in a sequence of 3 operations 
performed using this modified method. To simplify the terminology 
throughout this and subsequent discussions, we contrast noiseless and 
noisy systems. A noiseless system is one in which there are no component 
malfunctions whereas, for these discussions, a noisy system is any one 
in which the components have nonzero probabilities of malfunction. All 
results presented apply for either of the component malfunction models 
described in the introduction. 

Let us consider how a noiseless system consisting of memories, 
operation selectors, and computing devices might compute the desired 
result. Since there are 16 operations that could be performed on any 
pair of information digits in the operands, there must be 16 types of 
computing devices. One device of each type and one operation selector 
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is associated with each pair of information digits in the operands. Each 
operation selector decodes the operation code and selects the appropriate 
computing device. This device then performs the desired computation 
on one pair of information digits. In this way all the information digits 
in the desired result are computed. These resulting information digits 
are represented by the row vector v. 

The second step in computing the desired result is to compute the 
check digits. Let us assume that, within the computing system, k noise- 
less memories are used to store a set of k basis vectors for the code. This 
set of k basis vectors is represented by the generator matrix, G. It is 
assumed that the basis vectors have been chosen in such a way that G is 
in reduced-echelon form. (That is, G = [I,P] where I, isa k * k identity 
matrix and Pisa k X (N — k) matrix. The notation D = [AB] means 
that the matrix D can be partitioned into two submatrices, A and B.)° 
The desired result is obtained by performing the operation v © G 
where © represents the operation of matrix multiplication in which all 
additions are performed modulo-2. 

This matrix multiplication operation is performed in two steps. The 
first step consists of performing the bit-by-bit anp of the digits repre- 
sented v’ with the digits represented by each column of G (see Table I). 
The resulting row vectors, represented by the matrix G’, are stored in 
another set of k noiseless memories. A row of G’ is all zeros if the corre- 
sponding digit in v’ is 0 but it is identical to the same row of G if the 
corresponding digit in v” is 1. Therefore, the nonzero rows of G’ are the 
generators of the desired result. The final step in performing the matrix 
multiplication operation consists of adding modulo-2 the rows of G’. 

Let us see if a noisy computing system, containing stable memories 
rather than noiseless memories, can obtain the desired result in the 


TABLE [—AN EXAMPLE OF THE OPERATION V © G. 
v=f1 0 1 
100141 
G=/]0 1 0 1 0 
001 0 41. 
v? anped with columns of G 
100411 
={/0 000 0 
00%210i21 
A G@’ 


Vector sum modulo-2 of the rows of G’ 
1011 90 


ita 


voG 
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same way. The information digits of the result are computed by the 
method described above. This time, however, there is a nonzero proba- 
bility that any particular information digit is in error. There is now 
a problem when we try to use these digits to find the generators of 
the desired result. If there is any error in these information digits, 
the generators of the result will be specified incorrectly, in which 
case the resulting state of the stable result memory will almost cer- 
tainly be outside the desired class of states. Since there is no way to 
eliminate the errors in the information digits, the computing method 
as described is not suitable for use in a noisy computing system. 

The problem with the computing method that has just been described 
is that there is only one copy of the information digits of the result and 
there is no way to guarantee that this copy is error free. Suppose that 
we had many copies of these information digits such that the errors were 
statistically independent from one copy to another. We shall show later 
that it is actually possible to obtain copies with these properties. These 
copies of the desired information digits can be represented by means of 
a matrix V in which each column represents one of these copies. Each 
row of V is almost all zeros or almost all ones and the errors across any 
row of V are statistically independent. Each row of V that is almost all 
ones indicates that the corresponding row of the generator matrix is one 
of the generators of the desired result. Suppose that the generators of 
the code are stored in stable memories, the generator set. Since each 
stable memory containing k bits of information actually contains J-N 
binary digits (NV is the block length of the code and J is a parameter of 
the code described previously), the contents of these stable memories in 
the generator set can be represented by ak X JN matrix §. If we require 
that V be ak X JN matrix too (that is, that we have JN copies of the 
information digits), an operation can be performed on the memories 
equivalent to ANDing corresponding entries in the matrices G and V, the 
result being represented by a k X JN matrix §’. This operation leaves 
the desired generators essentially unchanged but it replaces the unde- 
sired generators with vectors which are equivalent to the zero vector. 

In this case an error in one copy of the information digits causes at 
most one error in the digits represented by §’. This is much better than 
the result of the previous method where one error in the information 
digits causes errors in an entire row of G’. If the probability of error per 
digit in §’ is not too high, it would be hoped that each stable memory 
corresponding to a row of §’ would be able to reduce this probability of 
error by performing one correcting cycle. The operation of vector addi- 
tion modulo-2 could then be performed to obtain the desired result. Since 
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this method looks promising, it will be considered in more detail. The 
first step is to make sure that it is possible to obtain copies of the infor- 
mation digits with the desired properties. Then a sequence of computing 
systems which perform operations by this method is analyzed to deter- 
mine if the sequence can be made reliable. 

When the computing system is ready to operate on a particular 
operand it copies the operand JN times and each copy is stored in a 
stable memory. A set of JN stable memories corresponding to one 
operand is called an operand set (see Fig. 2). An operand matrix, 
A; 7 = 1, 2, is defined in the following way. Each column of A; is assoc- 
iated with one stable memory in the corresponding operand set. The 
digits in a particular column of A; equal the digits in the first noisy 
register within the stable memory corresponding to that column of A; . 
Recall that each register within a stable memory contains a noisy approx- 
imation to the same code word. Thus the digits in each row of an operand 
matrix are almost all ones or almost all zeros. The digits different from 
the dominant one in each row correspond to errors in different stable 
memories within the operand set. Let each operand memory perform m 
(m = the number of independent iterations’) iterations on the digits 
contained in it. If no memory failure or propagation failure occurs in 
any stable memory within an operand set, the errors in the digits stored 
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Fig. 2— The generator set and typical operand sets. The digits are typical of 
the digits in each of the J noisy registers within the stable memory. 
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within a stable memory depend only on the component errors that 
occurred in that memory during the last m iterations,’ and these com- 
ponent errors are assumed to be statistically independent from one 
stable memory to another. Since the contents of these stable memories 
are represented by the columns of the operand matrix A, , the errors in 
one coJumn of this matrix are statistically independent of the errors in 
any other column. Therefore, after m iterations, the errors across any 
row of the operand matrix are statistically independent. 

The four stable memories which contain the code words that specify 
the operation code are called the “operation code memories.” The 
contents of each of these memories are copied JN times to form an 
operation code set (see Fig. 3). Each memory in each operation code set 
performs m iterations just as the memories in each operand set do. An 
operation code matrix 0; ,0 < 72 S 4, is defined for each operation code 
set; the definition is analogous to the definition of each operand matrix 
as stated previously. 

Let us consider a particular digit position in each of the operation 
code matrices and each of the operand matrices. The four digits in this 
position in the four operation code matrices form the operation code for 
the operation to be performed on the two digits in this position in the 
two operand matrices. After each memory in each set of memories has 
performed m iterations, the operations specified by the operation code 
matrices are performed. The digits which are the results of these opera- 
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Fig. 3— The operation code sets. 
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tions can be represented by k X JN matrix, each column of this matrix 
being an approximation to the information digits of the desired result. 

Since the errors in each row of each operand matrix and each opera- 
tion code matrix are statistically independent, and since the errors made 
by the components which perform the operations are statistically inde- 
pendent, the errors along any row of this resulting matrix must be 
statistically independent. Therefore, this is precisely the matrix V 
described previously. The digits represented by this matrix are anped 
with the digits represented by § to form the matrix §’. The operation 
of vector addition modulo-2 is then performed on the digits in the regis- 
ters represented by the rows of the matrix §’ to obtain the desired 
result (see Fig. 4). 

Finally we must compute the probability that all of the operations 
are performed in such a way that the state of the result memory belongs 
to the desired class of states. The first step required that m iterations be 
performed by each memory in the operand sets and the operation code 
sets. There are a total of 6-J-N stable memories in these sets. The 
memories represented by the generator matrix § must also perform m 
iterations. There are k memories in this generator set. It is assumed that 
neither a memory failure nor a propagation failure occurred in any 
stable memory before the m iterations were started. We wish to bound 
the probability that a memory failure or a propagation failure occurs in 
any memory during these m iterations. 

In Ref. 1 the probability that either a memory failure or a propagation 
failure occurs in any one memory on any particular iteration, given that 
no memory failure or propagation failure occurred previously, was upper 
bounded. This bound equals 


Pr[memory failure or propagation failure | no 
previous memory failure or propagation failure] 
C —B+2 
cy a 


where C and 8 depend on the parameters of the code and the proba- 
bilities of component errors but not on k. This bound has the same 
form for either of the two component malfunction models discussed 
in the introduction. 


Since 


k k 
a ee Ss se ee ~ 29 
Ns C20) and m < log |; 7k | for k > 2, 
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Fig. 4— (a) The result set corresponding to matrix G (illustrates the vector anp of 
the two operands). (b) The addition operation. 


the probability that a memory failure or a propagation failure occurs 
during the m iterations is bounded by 
Pr[memory failure or propagation failure during m iterations] 


< (|x| ae t} {Jog Fede yb? 


fork > 2. 

The operation is performed following these m iterations. The result of 
this operation should be the generators of the desired result, each 
generator being stored in a stable memory. These memories are called 
the result set and are represented by the matrix §’. Let us suppose that 
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no memory failure or propagation failure occurred during the m itera- 
tions. We now bound the probability that a digit within a memory in the 
result set is in error just before the time when the memory has completed 
the first correcting cycle on the newly inserted digits. If one of these 
digits is in error, at least one of the following events must have occurred: 


(1) The corresponding digit was in error in either one of the operand 
sets. The probability of this event is bounded by 2p, (sce Fig. 1). 

(it) The corresponding digit was in error in the generator set. The 
probability of this event is bounded by 74. 

(w2) An error was made in determining the operation to be per- 
formed. This probability is denoted by Peontrot. 

(iv) An error was made in performing the desired operation. This 
probability is denoted by Poperation- 

(v) An error was made in performing the anv operation. This 
probability is denoted by panp. 

(vz) An error occurred within the memory in the result set. The 
probability of this event is bounded by 2p,. 


Therefore, by the union bound 
Prierror in any digit within the registers in the result set] 


< 3p1 i Peontrol ae Poperation a Panp za 2p, 


and we require that this sum of probabilities be less than po. In order 
to show that it is possible to satisfy this inequality, let us consider 
a numerical example where 


_ = oa =_ oo po 197?° 
Peontrol ae Poperation _ Panp = Pa = Pr — Pa Fe. ] 


po = 107°, J = 14and K = 15. 
For this example p; ~ 2 X 10-?°, therefore 
3p = Peontrol i Poperation = PAND -+ 2p, wd) 1.1 >< 10°° < 107° — Po 


showing that the condition on 7p is satisfied. 

The probability that a memory failure or a propagation failure 
occurs in any one of the memories in the result set can be computed 
by exactly the same method used in the previous section to compute 
the probability of a memory failure in the stable result memory fol- 
lowing the operation of vector addition modulo-2. In fact the result 
is exactly the same as that obtained in the previous section since 
introducing the bound py makes all the relevant probabilities iden- 
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tical. The result is as follows 
Pr[memory failure or propagation failure in one memory in result set] 


_ Cees 
[1 — J/K]’ 
Therefore, the probability that any memory failure or propagation 
failure occurs anywhere in the result set is bounded by 


< 


Pr[memory failure or propagation failure in result set] 


¢ . —B+4 
sie Bee 77.6 eae 


The final step consists of performing the operation of vector addi- 
tion modulo-2 on the contents of the memories in the result set. The 
adder network, shown in Fig. 4, performs k—1 vector additions mod- 
ulo-2. According to the results of the previous section, the proba- 


bility that any one of these operations is performed incorrectly is 
bounded by 


Prferror in any of k — 1 @ operations] 


C ~B4+3 
< (k — 1) fi — a7KpP® : 
C ~B+4 

< i= yk = T/KE © sie 


The result is in error only if a memory failure or propagation failure 
occurs during the first m iteration, during the computation, or during 
the vector additions modulo-2. Therefore, the probability that the 
result is in error is bounded by 


Prfresult is in error] 


cern] + e=Snw mrt 


are 


fi — J/KE for k>2 


+ 


4 
= FP seit ° 


Finally, we must bound the probability that an error occurs during a 
sequence of 3 operations. This probability is bounded by the sum of the 
probabilities that an error occurs on any one of the 3 operations. There- 
fore, 
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Prf{error during a sequence of 3 operations] < 3-Presut - 


The dominant term in this bound is proportional to k~°** where @ is a 


constant for any particular sequence of computing systems. Ref. 1 gives 
a numerical example where 8 equals 7.55. For this example, the prob- 
ability that an error occurs in a sequence of 3 operations is bounded by 
a function which decreases as k~*. Therefore, for the sequence of com- 
puting systems for which 68 = 7.55, the probability of an error during 
3 operations can be made arbitrarily small by making k sufficiently large. 

Thus far we have not considered the complexity of these computing 
systems. The “basic processor,” that is, the machine that performs one 
operation on each of the k pairs of digits has a complexity which is 
proportional to k’. Let us suppose that the system S, is capable of 
performing a sequence of 3 operations on each of the k digits. In general, 
this requires 3 basic processors. Thus the complexity of the system is 
proportional to 3-k*. The amount of computation, the number of 
operations performed on pairs of digits, is equal to 3-k. Thus, for S, , 
the ratio of the amount of computation to the complexity, is proportional 
to k™*. The computing capacity equals the maximum value of this ratio 
for which the probability of error can be made arbitrarily small. Since 
the probability of error for S; is proportional to 3-k~°**, this probability 
of error can be made arbitrarily small only in the limit as k approaches 
infinity; but in this limit the ratio described above approaches zero. 
Thus the computing capacity for these systems equals zero. 


V. ARITHMETIC OPERATIONS ON OPERANDS OF BOUNDED MAGNITUDE 


The systems described in the previous section have two major 
shortcomings. The first is that the computing capacity equals zero 
and the second is that the systems are restricted to performing op- 
erations on corresponding bits in the operands. Let us consider the 
first of these shortcomings. We would like to show that it is possible 
to modify these systems in such a way that the amount of computa- 
tion per component is independent of k, whereas the probability of 
error decreases with increasing k. In order to obtain such a result, 
we must reuse the basic processor. This means that we must be able 
to “program” the computing system. We have already shown that 
it is possible to “program” the basic processor to perform different 
operations. We must now show that it is possible to store the program 
and the operands in memories which can be located when the contents 
are needed. 

In the previous section we described a method for locating desired 
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generators which were stored within the generator set. The jth genera- 
tor was found by first setting up an “address set” consisting of J-N 
stable memories each of which has a one in the jth information symbol 
position and zeros in all other information symbol positions. After 
all stable memories had performed m iterations, the generators were 
ANDed with the address set and the result was propagated through 
a modulo-2 addition network as shown in Fig. 4. We can store the 
program and operands in a “program set’ consisting of & memories 
and use precisely the same method to locate a memory in this set. 
Thus, the “address” of the jth memory in the program set is a 1 in 
the jth information position of a code word. 

In order to keep track of the next operation to be performed, we 
need to use one memory as an “instruction counter.” Initially this 
memory contains a code word with a 1 in the first information posi- 
tion and a zero in all other information positions. After each opera- 
tion, we shift the information digits one position to the right. A 
simple modification of the basic processor allows it to perform this 
shift operation. In order to specify whether a shift is desired we add — 
one more operation-code set. The information digits in the memories 
within this additional set are all 1’s if a shift is desired and all 0’s 
otherwise. (Notice that we are really wasting (k—1) of the informa- 
tion digits in each memory in this additional set since only one digit 
is required to specify whether a shift is desired). 

The processor checks the “shift bit” before performing any opera- 
tion to see if a shift is desired. If the shift bit equals 1, the shift is 
performed on the first operand and no other operations are performed. 
If the shift bit is 0, the operations specified by the other operation code 
sets are performed. In either case, the result is computed as before 
by adding the appropriate generators. 

The second major shortcoming of the systems described in the 
previous section is that only operations on corresponding bits in the 
operands can be performed. In order to perform more general opera- 
tions we might consider permuting the digits in one of the operands 
before the operation is performed. Unfortunately, the probability that 
a particular digit is permuted incorrectly depends on k and, in fact, 
approaches 1/2 as k approaches infinity. This problem arises because 
we have attempted to perform one operation that involves all k 
information digits, but it can be avoided by dividing the k digits into 
a number of segments where the number of digits in each segment 
does not depend on k; that is, the number of segments must grow 
with k. We can then treat each segment as a separate operand. This 
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means that we must restrict our attention to operations performed 
on operands of bounded magnitude, which is certainly not a severe 
restriction. 

Let us consider the digits in one particular segment. If it were 
possible to permute these digits before each operation, we could 
compute the sum or product of the digits in this segment or, in fact, 
any finite sequence of arithmetic operations on these digits by per- 
forming a sequence of bit-by-bit operations on the appropriately 
permuted digits. Consider the additional modifications that must be 
made to the basic processor in order to allow it to perform these 
permutations. For the purpose of specifying the desired permutation 
we must again increase the number of operation-code sets. If the 
longest segment contains s digits, the permutation can be described 
by the contents of the memories in log s additional operation-code 
sets. The permutation of a particular segment of digits is specified 
by the corresponding segments of the memories in these additional 
operation-code sets. (Notice that a permutation of s digits can be 
described by s - log s digits.) The operation is performed according 
to the method described previously. However, in this case the infor- 
mation bits within each segment of one operand are permuted just 
before the actual operation is performed. 

Let us consider how these modifications affect the bound on the 
probability of error for the basic processor. This bound is given in 
Section IV. The first term bounds the probability that either a mem- 
ory failure or a propagation failure occurs anywhere within the 
operand sets, the operation code sets, or the generator set during the 
first m iterations. Since there are now [log s] + 1 additional operation- 
code sets, the coefficient of this first term must be changed from 


J 7% lf 
«(; = Te) ee | " l« are at; . Tk) a i], 
However, the dependence on k of the first term is unchanged. 

The second term bounds the probability that a memory failure 
occurs either in the result set or in one of the memories required for 
the modulo-2 addition operation. In deriving this second term, it 
was necessary to bound the probability of error for digits in the 
result set. We considered a set of events at least one of which must 
have occurred if a particular digit is in error. There are now two 
additional events which could lead to such an error. The first is that 
the shift instruction was interpreted incorrectly and the second is 
that the permutation operation was performed incorrectly. In both 
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cases, the probability of error per digit is independent of k. Therefore, 
it is possible to find examples where py bounds the probability of 
error for digits in the result set. 

For example, consider the numerical values presented in Section 
IV. If the probability of a permutation error equals 10-?° and the 
probability of a shift error also equals 10-*°, then pp = 10°? is still 
a bound on the probability of error per digit. Thus the second term 
is not changed by the system modification. Therefore the dominant 
term in this probability of error is still proportional to k-&+4*, The 
equipment required to select the appropriate memory from the program 
set is similar to a basic processor. Therefore, the probability of mak- 
ing an error in one selection operation is of the same form as the 
probability of error for the basic processor. In particular the dominant 
term in this probability of a selection error is proportional to k-ft+*, 

Finally, let us consider the number of operations that can be per- 
formed by this modified computing system. We have allowed k mem- 
ories for storing the program and the operands. This means that the 
number of steps in the computation can be at least proportional to 
k where each step consists of one set of k operations performed by 
the basic processor. Therefore the amount of computation can be 
proportional to k?. Furthermore, the complexity of the system is also 
proportional to k? since only one basic processor and one operation 
selector is needed. Therefore, the amount of computation per com- 
ponent is independent of k and in general is nonzero. 

The probabibity of an error during the k steps in the computation 
is upper bounded by k times the probability that an error occurs 
during any one step. Therefore, this overall probability of error is 
proportional to k-&+°, Since we have already presented an example 
where 6 = 7.55, in this case the probability of error can be made 
arbitrarily small by making & sufficiently large while keeping the 
redundancy fixed. This shows that the computing capacity for sys- 
tems of this type is nonzero. 


VI. CONCLUSIONS 


There are basic processors, designed entirely from unreliable com- 
ponents, which can perform arbitrary binary operations on the corre- 
sponding information bit of two operands stored in stable memories. 
These information bits can be shifted, by one bit, or permuted, within 
segments of bounded length, before performing the operation. The 
probability of error for the basic processor can be made to vanish 
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as k-8+* where k is the number of information bits in each operand 
and @ is a function of the component malfunction probabilities and 
the information rate of the code used in storing information in the 
memories. 

A program, including the necessary operands, can be stored in k 
stable memories and the operations specified by the program can be 
performed in sequence by a single basic processor. It is possible to 
perform arbitrary k-step computations on numbers of bounded mag- 
nitude. Furthermore, the number of these computations that can be 
performed simultaneously is proportional to k. The complexity of the 
equipment required to perform the computations and the amount of 
computation are both proportional to k?. Thus the amount of com- 
putation per component and hence the computing capacity is nonzero. 

These results apply to systems designed from either of two types 
of unreliable components. The first type is one which malfunctions 
because it is perturbed by random noise in the system. The malfunc- 
tions of these components are modeled mathematically by assuming 
that they are statistically independent from one component to another 
and from one use of a particular component to another use. The 
second type of component is one which fails permanently; however, 
it. is assumed that components which have failed are regularly re- 
placed by good ones. In this case the mathematical model is based 
on the assumptions that components fail independently of each other 
and that there is an upper bound, of value less than 1%, on the proba- 
bility of malfunction of any particular component on any single use. 

All the results are existence proofs. Therefore, the criterion for 
choosing the particular systems to be considered was simplicity of 
analysis rather than efficiency of operation. Furthermore, the emphasis 
was not placed on obtaining the tightest possible bounds but rather 
on obtaining simple bounds that were sufficient to prove the desired 
results. In particular, one would hope that the probability of error 
would decrease exponentially with k rather than algebraically with 
k. However, the particular techniques used here do not lead to such 
a bound. 

There are many questions still to be considered concerning the 
design of practical systems constructed from unreliable components. 
There are also many theoretical questions concerning the derivation 
of bounds which are tighter than those derived here. It is hoped that 
the development of practical, reliable computing systems will follow 
the presentation of existence theorems like those presented here just 
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as the development of practical, reliable communication systems 
followed the presentation of the existence theorems of information 
theory. 
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APPENDIX 


A Bound on the Probability of Error 


In this appendix we upper bound the probability of error for one 
vector addition modulo-2. The method for deriving this bound is 
based on the one used in Ref. 1 to upper bound the probability of a 
memory failure. Since the method is long and rather involved, it is 
‘not reviewed here. Please see the proof in Ref. 1, because here we dis- 
cuss only those places where the two proofs differ. Both the terminol- 
ogy and the notation here is the same as in Ref. 1. 

The computing systems being considered are described in Section 
III. It is assumed that the operation was performed at t = T. The 
operation takes ¢,, seconds to be performed. Some time during the 
7 seconds following ¢ = T + tf), the stable result memory starts to 
perform the first correcting cycle on the newly inserted digits. We 
denote the time at which this correcting cycle starts by = T+ oc 
(see Fig. 5). 

It is assumed that neither a memory failure nor a propagation failure 
has occurred in either operand memory up to ¢ = J. A bound has 
already been derived on the probability of error per digit in the 
result memory. We must now bound the probability of a propagation 
failure occurring in the result memory. This involves relating the A 
configurations for the result memory to the A configurations for the 
two operand memories. Finally, the probability of a propagation 
failure will be used to obtain the desired bound on the probability of 
making an error in performing the operation. 
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Fig. 5 — Sequence of events pertaining to @ operation. 


To relate the A configurations for the result memory to those for the 
two operand memories, let us consider the type 7(¢ > 0) A configuration 
for the result memory evaluated at ¢ = T + o. If there is a 1 in the entry 
corresponding to the digit dy , the controlling errors must have changed 
the value of dy in the result memory; but this change can occur only if 
the controlling errors changed the value of one but not both digits in 
position d,) in Mi and MM’. If both these digits in position d, had been 
changed, the two changes would cancel each other when the operation 
of vector addition modulo-2 was performed. Therefore, the type 7(z > 0) 
A configuration for the result memory evaluated at ¢ = T + a is equal 
to the vector sum modulo-2 of the type 7 A configurations for M{ and 
Mi’ where the A configuration for M{ (or M?/’) is evaluated at the end 
of the last correcting cycle performed before t = T (that is, att = T — ¢’ 
ort = T — t”). 

The type 0 A configuration evaluated at ¢ = T + o is computed in 
a different way. A 1 in this type 0 A configuration represents a new 
error in the stable result memory at t = T + oc. To compute the con- 
figuration of new errors, we imagine that the computing system was 
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noiseless up to the beginning of the most recent correcting cycle per- 
formed on the digits stored in the result memory att = T + c. In this 
case, the ‘‘most recent correcting cycle” consists of the last correcting 
cycle performed by M{ before ¢ = T and the last correcting cycle 
performed by Mj’ before t = T (see Fig. 5). 

Since the probability of error per digit att = T ++ o can be bounded 
by po , the probability of a new error at i = T + o can also be bounded 
by po. Therefore the probability of a 1 in the type 0 A configuration 
evaluated at ¢ = 7 + o must be less than py . Since the type z(7 > 0) 
A configuration for the result memory evaluated at ¢ = T + ca is the 
modulo-2 sum of the type 7 A configurations for M@{ and M?’, the 
probability of a 1 in the type 7 A configuration evaluated att = T +o 
is bounded by twice the probability of a 1 in the type 7 A configuration 
for M; (or M?/’) evaluated att = T — Vv (ort = T — 0’). 

The method for computing a bound on the probability of a memory 
failure at t = T + o is exactly the same as that used to compute a 
bound on the probability of a memory failure at any other time for 
any stable memory. This method consists of bounding the probability 
that a noiseless correcting network could correct all errors present at 

= JT + o by performing m iterations. This probability is bounded 
by the probability that the initial propagation failure occurs at t = 
T +c or during the m noiseless iterations performed after ¢ = T + a. 

The initial propagation failure occurs whenever there are one or more 
1’s in the type m A configuration. Since, by assumption, no propagation 
failure occurred in either M/ or M/’ before t = T, the type m A con- 
figuration for M{ and M/’ must be all zero for allt < T. The type m A 
configuration for the result memory evaluated at ¢ = JT + a is the vector 
sum modulo-2 of two of these type m A configurations for Mi and M/’. 
Therefore, the type m A configuration evaluated at = T + o must be 
all zero. Hence, no propagation failure can occur att = JT + oc. 

A bound is derived in Ref. 1 on the probability of a 1 in the entry 
corresponding to the digit dy) in the type m A configuration evaluated 
at ¢ = Lr. The equation used to compute this bound is; 


Pr{i in one particular entry in type m A configuration 


evaluated after Lth iteration] 


< (J - 1K — 0( heal — 1p)" 


-Pr{i in one particular entry in type m — 1 A configuration 


evaluated after (Z — 1)th iteration] (1) 
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where po is both a bound on the probability of error for digits in the 
registers of MM, and a bound on the probability of a 1 in the type 0 A 
configuration. To compute a bound on the probability that the initial 
propagation failure occurs at ¢ = J’ + o + 7, we apply this recurrence 
relation. By applying it m — 1 times, we obtain .the probability of a 1 in 
the type (m — 1) A configuration for M/ evaluated att = T — ?, 
namely 


Pr[l in one particular entry in type (m — 1) A configuration evaluated 
att = T — v’) 


< p(t — IK —- ere ? UK - yep)” 


4 
a) eee 


The probability of a 1 in the type (m — 1) A configuration evaluated 
at t = T + o is bounded by 2 - P,,_1. One final application of the 
recurrence relation leads to 


Pr[l in one particular entry in type m A configuration evaluated at 


=T+o+7] 


< apdt = IK = v7 J/2 — ? UK - Dep}, 


Gallager has shown that there are low-density parity-check codes? 


log (J — 1)(K — 1) 
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where 


log {iu - no - (75? tax - Derr} 
Ogee Ne NY 
aa Ploe = ae 1) 

The probability that the initial propagation failure occurs at t = 
Tl’ + o + 7 is bounded by the probability of one or more 1’s in the 
type m A configuration evaluated at t = T + o + +. Since there are 
J-N entries in this A configuration, and since N < k/(1 — J/K), by 
the union bound, 


Pr{initial propagation failure occurs at = JT + o + 7] 


<8) ema - are 


~ 2Ck*?, 


This same bound applies to the probability that the initial propaga- 
tion failure occurs on any one of the m iterations performed after 
t=T+ 4. 

A memory failure can occur at £ = T + o only if the initial propa- 
gation failure occurs on one of the m iterations performed after t = 
T + o. Therefore, by the union bound, 


Pr[memory failure at i = T + o|no memory failure or 
propagation failure for t < T] 
= Pr[@ operation is performed incorrectly] 
< 2-C-m-kP*! 


< 2-C:log Ee Ee 


C —B+2 
<2 Teak k : 

To show that the computing systems being considered are reliable, we 
must show that it is possible to perform a sequence of 3 operations with 
an overall probability of error that can be made arbitrarily small by 
choosing k sufficiently large. After one iteration of the stable result 
memory (that is fort = 7 +o +7), the result stored in this memory can 
be used as an operand for another vector addition modulo-2. Let us 
suppose that another addition operation is performed at ¢ = T +o + 
7 = T’ and that the first correcting cycle on this second result starts at 
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i = T’ + o’. We now bound the probability that this second operation 
is performed incorrectly given that the first operation was performed 
correctly (that is, given that no memory failure or propagation failure 
occurred at = JT’ + oc). The method for deriving this bound is identical 
to the one used above. We bound the probability that the initial propa- 
gation failure occurs at ¢ = T’ + o’ or on any one of the next m itera- 
tions. In this case, the probability that the initial propagation failure 
occurs at t = J’ + o’ is not zero since a propagation failure could have 
occurred att = T”. 

Equation I relates the probability of a 1 in the type 2 A configura- 
tion evaluated after one particular correcting cycle, to the probability 
of a 1 in the type (2 — 1) A configuration evaluated after the pre- 
vious correcting cycle. The application of this equation is simple 
except in cases where an addition operation has been performed 
between successive correcting cycles. It has been shown already that 
in each case where an addition operation has been performed, we 
must double the probabilities that otherwise would have been sub- 
stituted into the equation. Since, in this case, there were two addi- 
tion operations performed within the m correcting cycles of interest, 
the value of this bound must be twice the value of the bound derived 
above. 

Therefore, 


Pr{second @ operation is performed incorrectly | first @ 
operation was performed correctly] 


< 2?-[m + 1]-C-K-*™ 


<2 o-tog |e k 


< 2? : 7 TE : 7B +? 


where we have used the bound 


k 
1—J/K k | 
mi <p te +1 < ts | ag for k > 2. 


This result can be extended to any number of vector additions 
modulo-2 performed in series. However, since only m iterations are 
considered in deriving this bound, the largest value that this bound 
can have corresponds to the case where an addition operation has 
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been performed between each iteration for the last m iterations. For 
this “worst case” the bound on the probability that the mth vector 
addition modulo-2 is performed incorrectly, given that all previous 
additions were performed correctly, is 


Pr[any @ operation is performed incorrectly | all previous 


@ operations were performed correctly] 


ZOE « log Fea i 


C k —B+2 
Sin E = ve 


C B43 

< it 7K * R 
In this Appendix we have not made any reference to the underlying 
assumptions concerning the models for component malfunctions. We 
are interested in two models as explained in the introduction. Since 
equation 1 applies for either of these models and since the mathe- 
matical development based on equation 1 also applies for either 
model, the final result, namely the probability of the initial modulo-2 
operation error, has the form given above for either component mal- 

function model. 
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A Self-Healing Control 


By BRUCE E. BRILEY 
(Manuscript received April 30, 1968) 


This article describes an electronic computer self-repair technique which 
does not require system duplication. It identifies the properties demanded of 
control hardware to effect this end and considers their practicability. It also 
decribes a prototype computer constructed to test the feasibility of this form 
of self-repair, and discusses test results. 


I. INTRODUCTION 


In electronic computing machines, where a high degree of availabil- 
ity is a requisite, some form of self-repair is provided. The rational in 
most such machines is that a given piece of hardware must be backed 
up by an identical part which may be switched in upon failure of the 
first. 

Work in nonreplicative self-repair has chiefly centered about the 
arithmetic unit and other controlled entities where the solutions, 
though not trivial, are relatively straightforward. The control portion, 
however, has consistently been avoided as unmanageable short of 
replication.t This paper describes a technique for what will be called 
“self-healing” the control of a machine which differs radically from 
conventional self-repair, and displays potential for considerable eco- 
nomic savings. 


II. DEFINITIONS 


Terms used in describing the technique and their definitions are: 

Order denotes that portion of an instruction conventionally used to 
identify the instruction type, not including the address and other 
fields of the instruction, even when used for augmentation rather than 
addressing. 

An order structure is called closed (closure under imitation) if to 
each order there corresponds at least one program of finite length 
using only a subset of the order structure diminished by the object 
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order such that the program imitates the salient actions of the object 
order in every detail except timing. 

A control is called failure autonomous if the circuitry associated 
with the sequencing of an order is unique to it and does not propagate 
the effects of an internal failure beyond the associated circuitry’s 
geographic bounds. 

A control unit is called self-diagnosing if, under failure, (1) the con- 
trol immediately ceases activity, (i) the identity of the offending 
order is immediately available, and (wz) the control is failure au- 
tonomous. 

A control unit is called entropic if the circuitry associated with each 
order assumes a state under failure such that any subsequent attempt 
to execute that order will cause a summary hang-up without any of 
the order’s control signals having become active. 

Self-diagnosing means immediate “incidental” diagnosis as a basic 
property, as opposed to “self-diagnosable,” which means lending itself 
to easy but finite diagnostic processing, and contains the former term. 

The fact that all order structures are not closed may not be ap- 
parent, but examination of the repertoire of a typical computer will 
bring to light orders which violate the definition. 

Proof of the existence of a nonclosed order structure capable of all 
the conventional operations requires only observation of the effect 
of a failure upon Van der Poel’s limiting case machine.? (Van der 
Poel shows that a computer with only a specialized subtract instruc- 
tion can perform all essential operations.) This machine is left with 
the empty set upon diminishing its repertoire by one. However, exist- 
ence in the repertoire of any order which performs a unique function 
such as setting a flip-flop accessible to no other order is sufficient to 
render the structure, of which it is a member, outside the closure 
definition. 

A seemingly trivial example of a closed order structure is the case 
of a repertoire with each instruction duplicated; the imitating routine 
for a given instruction, in this case, is of length one. This extreme 
appears to be, at best, equivalent to conventional duplication ap- 
proaches, and certainly worse than any other self-healing case. How- 
ever, H. Y. Chang has pointed out that no conventional checking 
circuits would be necessary in the control and imitation would be per- 
formed with no reduction in efficiency. This form of duplication, then, 
may be viewed as the boundary between self-healing and conventional 
self-repair, with some of the advantages of both. 
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II. BACKGROUND 


The definitions are chosen advisedly because a control which is self- 
repairing in the sense described must be the realization of an order 
structure which is closed, and must be entropic and self-diagnosing 
(which implies failure autonomy). 

The practical possibility of building a self-healing control came to 
light as the result of study of a (picoprogramming) control with self- 
diagnosing properties.* 

Self-diagnosing is a required attribute because (i) the identity of 
the offending order must be immediately available without processing 
(the control cannot be trusted to perform diagnostic processing), 
(22) control activity must cease until remedial action is taken or insane 
processing may be performed, and (2) failure autonomy must be 
realized or several orders may be rendered imperfect by a single failure. 

Entropic behavior is required so that (2) the repairing entity may 
be called automatically into play each time the defunct order is to be 
executed and (iz) the defunct order’s circuitry will remain inactive 
and not interfere with the repairing entity’s action. 

The order structure realized must be closed to permit the technique 
to function; this requirement is central to the technique’s operation. 


IV. PRACTICABILITY 


These requirements may seem artificial; indeed, unattainable. How- 
ever, the self-diagnosing control has the very properties demanded 
above, so that the only otherwise artificial requirement to meet is that 
of order-structure closure. 

We conjecture that the addition of about 10 percent to the repertoire 
of a typical machine would effect closure. Recalcitrant instructions, 
the length of whose imitating routines threaten to increase without 
bound, can be handled by the compromise of duplicating those instruc- 
tions only. Their imitating routines then shrink to one instruction in 
length. (A machine designed specifically to be self-healing would, of 
course, try to avoid such instructions.) 


Vv. THE TECHNIQUE 


The technique is relatively simple. An inductively coupled detec- 
tion lead monitoring all order circuits observes a continuous sequence 
of pulses during normal operation. However, because of the second 


* See the Appendix and Refs. 3 through 6. 
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property of the self-diagnosing feature, when the circuitry associated 
with an order fails, the monitoring lead will notice a cessation of ac- 
tivity. A time-out will take place, and the machine will be placed in 
the remedial mode. 

In the remedial mode, the entire (failed) instruction is stored, then 
control is transferred to a location in memory whose address is gen- 
erated from the order field of the instruction, and normal operation 
is resumed. The transferred-to location in memory contains the first 
word of a routine which imitates the action of the failed order using 
only other orders. This is possible because of the closed order structure. 

The transition from imitation to a continuation of the program 
being run is smooth, requiring no intervention. The next time execu- 
tion of the failed instruction is called for the machine hangs up again, 
without any control pulses being generated, because of the control’s 
entropic character. The same procedure as before is then followed. 

A simple example is logical left shift (shift the contents of the ac- 
cumulator logically one binary place to the left), because its imitating 
routine is brief. Figure 1 shows the effective mechanization of self- 
healing for the shift left instruction. The action of sensing the health 
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Fig. 1— Example of self-healing. 
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of an instruction is indicated by the test preceding each instruction; 
actually, the test takes place throughout the duration of an instruc- 
tion’s execution, and is much less specific than the illustration im- 
plies, asking only the general question, “Is the machine still run- 
ning?” rather than, “Is shift left operative?” 

The net result is that the machine is, in effect, healed; software is 
used as a patch over the “crack” in the hardware. Execution of the 
failed instruction will be, in general, very inefficient, but if the time- 
out is short, the overall speed of the machine generally will be only 
slightly affected. 

If fast memory is considered too expensive or limited in size to 
contain all of the imitation routines, they may reside in auxiliary 
storage, and the one selected under failure would be called into a 
standard block in fast memory. Subsequent uses of the defunct order 
would execute its imitation routine out of this block. Such “calling” 
would require more elaborate remedial hardware. 


VI. REALIZATION 


The picopragrammed control lends itself to self-healing because of 
certain properties peculiar to its implementation: 


(t) The circuitry implementing each order is segregated to a single 
card and is unique to it; magnetic coupling to the control leads pre- 
vents propagation of failure effects. 

(12) Because the control is autochronous*® (self-timed), a failure in 
a sequencer causes an asynchronous hang-up, leaving the order reg- 
ister in one-to-one correspondence with the failed sequencer. 

(22) The nature of the ferrite disk? sequencer is such that any fail- 
ure in the active sequencing mode will leave the disk in a partially 
switched condition (in a state representing one element of a continuum 
of states) which prevents the success of any subsequent attempts at 
switching, and results in a hang-up. No control signals are generated 
by attempts at switching subsequent to failure. 


To a “conventional” picoprogrammed control must be added intel- 
ligence to perform remedial action. The remedial tasks illustrated in 
Fig. 1 are: 


(1) Store in an appropriate location the nonorder fields of the in- 
struction (which usually change each time the order is executed). 

(wz) Store in an appropriate location the current address (or the 
current address +1). 
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(12) Derive from the order field the location of its imitating routine. 
(tv) Transfer to the imitating routine. 


The imitating routine alters its instructions according to the non- 
order fields of the defunct instruction, performs the imitation, and 
transfers to the defunct instruction’s successor in the interrupted pro- 
gram. 


VII. PROTOTYPE IMPLEMENTATION 


A small prototype computer was built at Bell Laboratories to test 
the feasibility of self-healing (see Fig. 2). Picoprogramming was 
used, providing the necessary properties for self-healing. 


7.1 Duplication 


The special (limiting) case of duplication is the simplest to im- 
plement, and was the first tried. In effect, dual repertoires are in- 
stalled whose order codes differ by one bit (are adjacent). Any num- 
ber of techniques could be used in a full scale system to guarantee 
that both halves of the extended repertoire are exercised regularly, 
such as assembler insertion or internal bit complementing. The 
equivalent of the second technique was used in the prototype because 
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Fig. 2— Prototype self-healing processor. 
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it does not require an additional bit per order in main memory; the 
internal order code was augmented with a bit which served to choose 
between twin instructions. 

The mechanism used to recognize failure (cessation of activity) was 
a simple integrating circuit, which monitored the completion signal 
common to all instructions, and a level discriminator. 

The remedial action consisted simply of complementing the “twin” 
bit and injecting a bogus completion signal to start the system again. 

Physically simulated failures were experimentally shown to yield 
to self-healing independent of the instruction class, and the imple- 
mentation of this grade of self-healing was definitely proved possible. 


7.2 Nonduplication 


A test off the boundary line was necessary to show that a non- 
duplicating grade of self-healing is possible. The limited size of the 
prototype’s repertoire, however, precluded a complete closed repertoire 
from being tested. Instead the properties of self-healing were studied 
‘ain the small’; repertoires were chosen which could imitate a subset 
of themselves, but which did not include duplicates. 

The procedure followed was to place the machine in a convenient 
program loop performing calculations easily checked for accuracy, 
such as a first order Markov chain. In another memory area was 
stored a routine that could imitate the action of an order used in the 
calculating routine. 

A manual switch was provided to either disable or invoke self- 
healing capability, and another to disable the object order. 

With the machine running in the calculation loop, and with self- 
healing disabled, it was easily shown that disabling (simulating a 
failure) the object order would bring the machine to a halt, leaving 
on display its location, its identity, and its nonorder fields. 

Invoking self-healing capability, and running in the calculation 
routine, disabling the object order had no effect upon the calculations’ 
accuracy in most cases; however, occasionally an error would occur. 
The errors (which will be explained) occurred only at the instant of 
simulated failure; calculations after that (while self-healing) were 
properly performed. When the simulated failure was removed, the 
machine automatically reverted to its original mode of operation. 

An order is always assumed innocent until proven guilty. That is, 
whether or not an order failed the last time it was to be used, an at- 
tempt is made to use it the next time the program calls for it. The 
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innate properties of a failed order thus provide the necessary memory 
that the order is inoperative, and prevent these attempts from causing 
trouble. Should the order spring back to life again, however, either 
because the “failure” was caused by noise or because the failed card 
is replaced, the order “takes over” again, and self-healing ceases. 

This technique does not solve all problems. One difficulty is that a 
fragment of an instruction may be executed before failure occurs 
(this was the cause of the occasionally observed errors at the instant 
of failure), and it may be impossible to reconstruct the status of the 
system before execution of this instruction began. Two solutions are: 


(t) Defensive programming, a technique which is recommended 
for all real time programming applications, but difficult to implement 
as a complete solution. It consists of programming in such a fashion 
that errors (the outward manifestation of a partial instruction execu- 
tion) do not appreciably disturb operation. 

(vw) Picoinstruction counting, which forces the picoinstructions to 
count themselves as they are executed (the counting register is reset 
upon the successful completion of each instruction). Under failure 
the contents of this register are digested by the imitating routine, 
which (the first time) performs only those operations not yet per- 
formed by the defunct instruction. 


VIII. HARDCORE 


8.1 Control Hardcore 

The hardcore (that portion whose failures cannot be healed) 
specifically associated with a self-healing control consists of a fixed 
and a variable component. The fixed component includes the instruc- 
tion register, an order register which copies the order field of the 
instruction register, and their associated circuitry; in addition, a 
relatively simple integrating and level sensing circuit to recognize 
and react to failure, and a flip-flop and timing circuit to store the 
control mode and provide interinstruction timing, are required. 

The variable component depends upon the degree of duplication. 
For full duplication a flip-flop is needed to differentiate between 
twin instructions. Zero duplication requires a means for storing the 
nonorder field of the failed instruction and its address for use by 
the imitating routine, and a means for generating the address of the 
first instruction of the imitating routine and placing it in the instruc- 
tion counter register (assuming a single address format). 
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8.2 Noncontrol Hardcore 


The noncontrol portions of the machine are not directly aided 
by self-healing, but the indirect help is far reaching. For automatic 
diagnosis, the hardcore of the machine does not include the entire 
control, but rather the control’s relatively small hardcore. This means 
that much more of the machine may be automatically diagnosed. For 
extension beyond diagnosis to self-repair (and possibly self-healing) 
in the noncontrol portions of the machine, the more effective diagnosis 
and more trustworthy control are substantial aids. 

Among the more interesting possibilities is that of building an all 
memory machine (including the control) which would use tables in 
a manner reminiscent of the IBM 1620 to replace the arithmetic 
unit and reduce the diagnostic problem to one of handling memory 
(which, because of its relatively homogeneous nature, tends to be 
tractable). 


IX. ANALOGIZING VIEW 


Suppose it were feasible to store a set of remedial routines for each 
order, which could imitate the order’s action for all possible combina- 
tions of order failures. Then as the machine grew old, and one by one 
the orders failed, the control would survive but become increasingly 
slower until a minimum critical subset (possibly one order) remained. 
Under these conditions the control would be acting in a manner anal- 
ogous to the compilation of a high-level language, written in itself. It 
is necessary in the latter case (the compilation) to (software) im- 
plement a critical subset of the language which can bootstrap the 
remainder of the language, just as it is necessary to have a healthy 
(hardware) implementation of a critical subset of the instructions 
in the former case (the aged machine). Extending the analogy further, 
a correspondence may be seen between the computer’s microinstiruc- 
tion language (if it exists) for the former case, and the machine lan- 
guage of the computer used in the latter case. 


X. CONCLUSIONS 


Self-healing is a practicable and potentially useful variation of 
self-repair. 

(Comment: Although picoprogramming was used in this work, it 
was a vehicle of convenience, not an essential constituent. Other im- 


plementations are possible.) 
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APPENDIX 


Picoprogramming 


This is a very brief description of picoprogramming; the serious 
student should see Ref. 3. 

Picoprogramming may be viewed as a logical extension of micro- 
programming wherein the control memory, which in a practical 
microprogrammed machine constitutes some fraction of the control, 
is allowed to expand until it is identical with the control. Figures 
3 through 5 depict the evolution in oversimplified form. 

Figure 3 illustrates the typical pre-third-generation machine con- 
trol, with the order portion of the instruction used to identify a 
sequence implemented, in general, with ill partitioned circuitry de- 
noted by crosshatching. 

Figure 4 shows the microprogrammed control where the order field 
identifies the starting address of a microprogram in the control mem- 
ory, but a conglomerate of circuitry is still required to implement 
the microinstructions, count through them, and the like. 

Figure 5 shows a picoprogrammed control consisting only of a 
memory with the order field now choosing a single memory element. 
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Fig. 3— Conventional control. 


SELF-HEALING CONTROL 2377 


CONTROL 
MEMORY 





Fig. 4— Microprogrammed control. 


The memory element in Fig. 5 is a square loop ferrite disk called 
a myra (for myria-apertured) disk. This disk has the property that, 
when selected, it can spill out many sequential strings of control 
pulses temporally juxtaposed in almost any desired fashion and at 
voltage and driving point impedance levels capable of driving most 
logic circuits directly. Most instructions can thus be implemented 
with a single disk. 
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Fig. 5 — Picoprogrammed control. 
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When these disks are mounted separately with their drivers, they 


exhibit failure autonomy. Further, since no clock is used, the disk 
corresponding to an instruction under execution is in complete and 
independent control, and the machine stops if it fails. In fact, an 
unequipped card can be used to implement a HALT instruction. 


The instruction cards can, in general, be interchanged to provide 


a variable repertoire. 
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Efficient Spacing of Synchronous 
Communication Satellites 


By HARRISON E. ROWE and ARNO A. PENZIAS 
(Manuscript received April 10, 1968) 


Satellites in equatorial circular synchronous orbit remain fixed with 
respect to an observer on the earth, while those in a circular synchronous orbit 
inclined with respect to the equator appear to move in a figure 8. A number of 
satellites can move on a given 8. Optimum packing for a single 8 is de- 
termined; the number of satellites per 8 increases as the 8 becomes larger and 
as the allowable closest approach between satellites decreases. Several 
packing schemes using multiple 8's, regularly spaced, are presented. For 
potential systems using frequencies above 12 GHz, with perhaps 1° closest 
approach between satellites, the best scheme described here permits approxt- 
mately six times as many satellites as an equatorial system. An illustrative 
system serving North America has space for a total of 477 satellites, com- 
pared with 96 satellites if only equatorial orbits are allowed. 


I. INTRODUCTION 


When a satellite is in an orbit whose period is equal to that of the 
earth’s rotation, it is said to be synchronous. A satellite in a synchro- 
nous, circular, equatorial (in the plane of the equator) orbit will 
appear stationary when viewed from the earth. The number of such 
satellites that can be used in a communication system in the same 
frequency band is limited by the directivity of the ground-based 
antennas.t The maximum number would be obtained by spacing them 
equally, at an angular separation just sufficient to keep crosstalk down 
to a tolerable level. 

If the synchronous orbit is inclined to the equator, a satellite will 
no longer appear stationary when viewed from the earth, but will 
appear to move in a figure 8. By placing several satellites in a single 8, 
and spacing different 8’s regularly along the equator, a greater number 
of synchronous communication satellites can be used than with an 
equatorial orbit alone. 
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A price must be paid for increasing the number of synchronous 
satellites by using inclined circular orbits: 


(t) The earth-based antennas must track the satellites. 

(1) The satellite-based antennas must track the earth stations. 

(w2) As the angle of inclination of the orbits is increased (in order 
to increase the number of satellites), the range of latitudes on earth 
within which the satellites will always be visible decreases. 


In this paper we study optimum ways of packing satellites on 
inclined circular synchronous orbits, and determine the maximum 
number of such satellites that can be used under various conditions, 
assuming: 


(1) Strictly circular, synchronous orbits, with consequent uniform 
satellite velocity. 

(it) Each satellite is usable at all times; that is, each satellite 
remains in view of ground stations located as far north as the 
Canadian-United States border, over a strip of substantial east-west 
width. 

(2) If the angular separation between any two satellites exceeds 
a certain minimum value, the crosstalk requirements will be satisfied. 
This minimum value is a parameter in the analysis. 

(wv) The angular separation between satellites is computed as seen 
from the center of the earth. 


These assumptions have the following corresponding consequences 
for this analysis: 


(1) The problem is a kinematic or geometric one, rather than one 
in mechanics as if elliptical orbits, the oblateness of the earth, the 
effects of the moon, and so on, were considered. (Notice that we do not 
consider synchronous elliptical orbits.) This assumption implies that 
the satellites have sufficient on-board fuel to correct their orbits for 
various small perturbations (such as those mentioned above) over the 
expected life of the satellites, in addition to any fuel needed to track 
the earth stations if this is done by mechanically rotating the satellites 
(rather than by electronic beam-steering of the satellite antennas). 

(w) Assumption 7 guarantees that there need be no switching from 
one satellite to another in such a communication system; this estab- 
lishes a convenient bound on the present investigation. Once switching 
from one satellite to another is allowed, the system complexity appears 
to have no natural bound; one could theoretically fill the sky with 
satellites, and find a convenient one somewhere. 
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(72) The minimum separation assumption avoids a detailed analysis 
of crosstalk. Such an analysis would have to include, for example, the 
type of modulation (AM, FM, PCM, and so on), the (angular) 
distances to all the satellites in the neighborhood of the one under 
study, the location of the ground stations, ground- and satellite- based 
antenna patterns, and the like. 

(iv) By computing separation relative to the center of the earth, we 
neglect the small corrections which depend on the location of the 
ground stations; these corrections in satellite separation can vary 
between zero and as much as +18 percent in the worst case, as shown 
in Appendix C (see also Fig. 1 or 2, discussed below). However, 
assumption 2v leads to simple, exact expressions for almost all of the 
interesting results. 


Figures 1 and 2 show to scale the earth and a synchronous satellite, 
at declinations of 25° and 30° south latitude. The latitudes of repre- 
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Fig. 1 — Synchronous satellite on an inclined orbit; southernmost declina- 
tion = inclination = 25°. 
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Fig. 2—Synchronous satellite on an inclined orbit; southernmost declina- 
tion = inclination = 30°. 


sentative points in the United States and Canada are shown. Much 
of the United States is south of 45° north latitude; a ground station 
at this latitude will observe a synchronous satellite at the same longi- 
tude (as the ground station) at an elevation of 11.6° (Fig. 1) or 6.4? 
(Fig. 2) above the horizon. The smaller the satellite elevation the 
greater the path length lying within the atmosphere, with attendant 
greater rain attenuation, wavefront distortion, and other atmospheric 
effects. 

Section II indicates that the extreme value of latitude (north and 
south) attained by a synchronous satellite is equal to its orbit’s angle 
of inclination from the equator. Figures 1 and 2 suggest that the 
maximum angle of inclination of interest for the United States is about 
30°. Such large angles will probably rule out major parts of Canada. 
However, in the systems described below, 8’s containing several satel- 
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lites each will be regularly spaced along the equator, separated by a 
smaller number of satellites in equatorial orbit equally spaced between 
the 8’s. These equatorial satellites could probably be used to serve the 
lighter traffic to the more northern regions of North America or the 
more southern regions of South America, while both the figure 8 
satellites and the equatorial satellites serve the heavy traffic within 
the continental United States. 


Il. GEOMETRY OF INCLINED, SYNCHRONOUS ORBITS 


Let us compute the angular separation between satellites as seen 
from the center of the earth (assumption iv, Section I). This permits 
the calculation of all desired quantities by the straightforward use 
of the results of spherical trigonometry. By convention in spherical 
trigonometry, great circle distances on a sphere are measured by the 
angle they subtend at the center of the sphere; we use this convention 
throughout this article without further comment. By convention, with 
one principal exception, all lines drawn on the surface of a sphere in 
any figure of this article are great circles; the exception is the 8 
described in the Introduction. Thus, longitude lines are allowed, while 
latitude lines are not. 

Figure 3 shows the earth with north and south poles, the equator, 
and the projection on the earth of a (circular) synchronous orbit, 
inclined at an angle a to the equatorial plane, passing through the 
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Fig. 3— Reference point for satellite. c = 0°. 
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equator at its midpoint as seen by the observer. Since the orbit is 
viewed edge-on, it appears as a straight line rather than as an ellipse. 
A satellite is shown on the orbit line as a heavy dot. At the instant 
shown in Fig. 3, the satellite is at the intersection of the orbit with 
the equator; this configuration provides a convenient reference from 
which to measure time (or, equivalently, rotation). Finally, a longitude 
line is shown as a convenient reference on the earth, passing through 
the point on the equator which the satellite crosses; this also appears 
as a straight line in Fig. 3, since the plane of this longitude line con- 
tains the observer at this particular instant. 

Figure 4 shows the satellite and the earth after some time has 
evolved: after the earth and the satellite have travelled an (angular) 
distance c. Of course, c will vary linearly with time in all of the 
following, increasing by 27 radians or 360° in one day. In Fig. 4, 
c = 7/4 radians = 45°. Since both the satellite and the point on the 
equator that coincided with it when the satellite crossed the equator 
have travelled a distance c < 90°, the satellite will lie to the left to 
the reference longitude line, on the dashed longitude line shown in 
Fig. 4. When c = 90° the satellite will have caught up with the longi- 
tude line and have attained its maximum north latitude (at the right- 
hand edge of the earth in Fig. 4). As c increases further the satellite 
will get ahead of the longitude line and start south; at c = 180° the 
satellite crosses the equator and the longitude line simultaneously, the 
situation being as shown on Figure 3 except that satellite and longitude 
line lie on the far side of the earth. For 180° < ¢ < 360° the above 
sequence is simply repeated symmetrically below the equator. 

It is clear from the above description that such a satellite will move 
in a figure 8 as seen from the earth. Figures 5a and b illustrate this 
8 pattern superimposed on Figs. 3 and 4. 

The geometric parameters that describe the 8 are defined in Fig. 4, 
where we recall that all (great circle) distances, namely, c, 1, a, e, and 
y, are measured by the angle they subtend at the center of the sphere. 
L is the latitude of the satellite, e the longitude of the satellite mea- 
sured with respect to a fixed observer located at the point on the 
orbit where it crosses the equator, wy is the relative longitude of the 
satellite measured with respect to an observer on the earth at the 
equator and the reference longitude line, a is the great circle distance 
from this earth observer to the satellite, and e the relative angle 
(azimuth) between the earth observer’s north and the satellite; a and 
e correspond to local polar coordinates fixed to the surface of the earth. 
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Fig. 4— Satellite at some later time than Fig. 3. c = 45°. 


In Fig. 4 and subsequent figures we adopt the convention that dimen- 
sions marked with two arrows are regarded as positive quantities; 
those with only a single arrow are positive in the direction of the 
arrow, and negative in the opposite direction. 

Figure 6 shows an enlarged drawing of the 8 of Fig. 5b viewed from 
directly overhead, indicating those parameters of Fig. 4 that are 
measured relative to the earth observer. In addition, x measures the 
great circle distance from the satellite to the reference longitude, and 


Dy 


Fig. 5 — Figure 8 pattern, (a) c = 0° as in Fig. 3; (b) ¢ = 45° as in Fig. 4, 
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Fig. 6 — Enlarged 8 with relative geometric parameters. c = 45°. 


t the (great circle) distance from the satellite to a point on the equator 
a distance ¢ from the “cross” of the 8.* 
The following results hold true for the parameters of Fig. 4 and 6: 


. a a TOS 
sin 5 = sing 'sinc. (1) 
- 
sin 5° COs ¢ 
sine = ~———--___- (2) 
~2@. ‘ 
(1 — sin” 5 "sin’ c) 


* The instant illustrated in Fig. 6 (c — 45°) is rather special, in that x has its 
maximum value and the dashed longitude line passing through the satellite is 
almost tangent to the 8. These conditions obviously do not hold in general. 
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. a = 
gin? 9 ‘sin 2¢ 








Se i (1 — sin? e-sin’ c)} (4) 
Ste cosa@:sinc . 
sine = (1 — sin? asin’ 0)! (5) 
ety=c. (6) 
sin x = sin’ = sin 2c. (7) 
cos t = cos’ £-cos ¢ + sin? =- cos (2c + &). (8) 


2 2 


Next, we consider the (great circle) separation of two satellites 
travelling on the same 8. While these satellites lie on the same 8, they 
lie on different orbits. The geometry of this situation is illustrated in 
Fig. 7. One orbit is viewed edge-on, as in Figs. 4 and 5; the other orbit’s 

plane is inclined, and so it appears as an ellipse in this figure. The 
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Fig. 7 — Two satellites on the same 8. c: = 45° cs = 90°. 
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distance between the two satellites is denoted by v. The two satellites 
have phases c; and Cs, with phase difference co; c; and Cz of course 
have the same linear variation with time, while co is fixed. Then the 
distance between the two satellites is given by 


a 


: : re C 
5 sin’ c, + sin’ a-sin’ 3 cos” (, fs ) ; 


~ 2 ae 
sin 5 = sin 
2 2 


Co = Cg ~ C1. (9) 
Equation 9 yields equation 1 by setting v = a, cy = c, Co = 0. 
Consider next Fig. 8, which shows two identical 8’s spaced by a 
distance £ along the equator. The parameter u denotes the (great 
circle) distance between symmetrically located points at the same 
latitude. wu is illustrated at two representative times, indicated as wy, 
and us. The corresponding ranges for cz are shown on Figure 8, where 
Cz is the phase parameter on the right 8. Then 


sin 5 = cos” sin § — sin’ 5 sin (26, = d). (10) 

Our final result contains some of the preceding results as special 
cases. Consider two 8’s of differing size, characterized by parameters 
a, and ag, spaced by a distance ¢ along the equator. This situation is 
illustrated in Fig. 9. While for clarity the two 8’s in Fig. 9 have been 
drawn as not overlapping, this is not a necessary restriction in the 
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Fig. 8 — Distance between identical 8’s at the same latitude. cp = phase param- 
eter on right-hand 8. 1:0 < ce < 90°; w2:90° < co < 180°. 
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Fig. 9— Distance between two satellites with arbitrary phases, located on dif- 
ferett 8’s of arbitrary sizes. 


following. Consider a satellite on each 8, having phases c, and cg. Then 
the great-circle separation s between these two satellites is given by 


a 
cos s = cos £: cos” a cos” 2 


Sin a;-S1D a> 


+ ones [cos (co — ¢;) — cos (C2 + ¢,)] 


+ sin’ 5 COs” : cos (2c, + ¢) 


+ sin’ = cos’ + cos (2c. — ) 
+ sin? 3 sin’ mn (2c, — 2c, — §). (11) 


The following specializations of equation 11 yield the indicated 
above results: 


Equation - Pues m Equation 11 


1 Set =e =ea= oe f=. 
“8 -s=tia=a =6¢=O0ora = 0. 
9 S=Via, =o =a,f = 0. 

10 S=UuUla =o =a, = 47 — 
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One final specialization of this result is of interest, in which the two 
8’s have the same size. For convenience we denote the phase difference 
for the two satellites (each on its separate 8) by co, as in equation 9. 
Thus setting 

QQ, = =a, (12) 
C=C +o, 
in equation 11, yields 


eo een ee ee 5 int Ssin ( Hf 
sin 5 = | cos 9°55 + sin 9 sin Co 9 


+ sin a-sin (% 5 ©) -e0s (c “+ ) ; (13) 


This relation also reduces to equation 9 for £ = 0, in an obvious way. 

All of the above results are exact; no small-angle or any other ap- 
proximations have been made. Their derivation is sketched in Ap- 
pendix A. They are necessary for calculating minimum satellite sepa- 
ration in various packing schemes, discussed in Sections III through V. 





III. OPTIMUM PACKING ON A SINGLE 8 


Equation 9 shows that the separation between two satellites on 
the same 8 varies periodically with time in a simple way. This rela- 
tion, together with Fig. 7, readily permits the exact calculation of 
the closest approach or minimum separation Umin and the correspond- 
ing satellite phases C; min 2Nd Ce min at Which it occurs. 


Cimin = & 


wl 


T Co 
) Comin = +5 +5) 


Nl 


(14) 
Umin 


2 





sin = sin’ 5-| sinc | , Co = Co — C1, 
where all c’s are measured in radians. 

Figure 10 illustrates the satellite positions at minimum separation 
for two representative cases. Minimum separation for any two satel- 
lites occurs when the two lie at the same latitude. (A similar study 
for maximum separation is easily performed; maximum separation is 
illustrated in Fig. 11 for the same satellite distributions shown in 
Fig. 10.) 

Consider N satellites placed on an 8, with phases c,, co, -*- cy, all 
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Fig. 10 — Satellite positions at minimum separation. Identical (minimum) sep- 
aration in both figures. (a) co = 20°, (b) co = 160° 


having the same linear variation with time ¢ (but different values at 
t = 0); each c increases by 27 radians in one day. We represent the 
phases of these satellites by points on a circle at angles c,, Cz, °** Cy. 
This pattern of points will rotate uniformly counterclockwise at one 
revolution per day, and the relative angular positions of these repre- 
sentative points will remain constant with time. One such point is 
shown in Fig. 12 at angle c,;. Suppose the desired minimum separation 
between any two satellites on this 8 is specified as vyin. The separation 
between the satellite with phase c; in Fig. 12 and any other satellite 
will equal or exceed Umin if (equation 14) 











sin “= 
| sinc, | = = si Comin » (15) 
sin” 3 
i“ 
\ 
(a) (b) 


Fig. 11— Satellite positions at maximum separation. (a) co = 20°, (b) co = 160°. 
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Fig. 12— Phase of a satellite with associated sector. comin given by equation 15. 


where Cp is now regarded as the phase separation between the satellite 
under study (No. 1) and any other satellite on this 8. 


Suppose we associate with the point c, on Fig. 12 a shaded sector of 
angle Comin centered on.c, , and the image of this sector reflected through 
the axis. Next imagine a second representative point placed on the circle 
at angle c. , with a similar pair of shaded sectors. If the shaded regions 
corresponding to the two representative points do not overlap, equation 
15 will be satisfied and v = vn;, . Additional points cs , c, , +++ may be 
similarly added such that none of the shaded areas overlap, and the 
separation v between any pair of satellites at any time will be guaranteed 
to exceed Umin - 

It is clear that efficient packing requires adjacent shaded areas to 
just touch; optimum packing will be attained if the entire area of the 
circle is filled with shaded sections, that is, if 


N =- =, (16) 
0 min 


where JN is the total number of satellites (co min is in radians). The 
minimum separation with optimum packing for WN satellites on the 8 
is therefore given by 





CA ae A cae ep 
sin“ = sin’ 5°sin (17) 


Equation 17 is illustrated in Fig. 18, where the pertinent angles, Umin 
and a, have been expressed in degrees rather than in radians. 
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It remains only to illustrate the possible geometric distributions of 
satellites corresponding to the optimum packing of equation 17. Figure 
14 illustrates the geometry for an odd number of satellites, N = 3. 
The time has been arbitrarily selected so that one satellite is located 
at the “cross” of the 8, at zero phase. In Fig. 14a the satellites have 
been uniformly distributed over half the circle; this may be regarded 
as the canonical optimum distribution, with others derived from it. 
The discussion in connection with Fig. 12 makes it clear that an 
equivalent optimum distribution results from shifting the phase of 
any of the satellites by 180°. 

Figure 14b shows a possible alternative. In general, if the even 
numbered satellites for N odd are shifted by 180°, then the satellites 
will be equally-spaced in phase over the entire circle, as illustrated 
in Fig. 140. This may be regarded as a second canonical distribution, 
of subsequent interest for an odd total number of satellites on the 8. 
Other distributions are clearly possible, although not for N = 8. Figure 
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Fig. 13 — Closest approach of satellites on a single 8 with optimum packing. See 
equation 17. VN = number of satellites on a single 8, 
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Fig. 14— Optimum packing for N = 3. Uniform phase spacing over (a) 180° 
and (b) 360°. 





15 shows the satellites at different times for the two situations in Fig. 
14; the instants of closest approach indicated in Fig. 15 have identical 
minimum separation. 

In general, the optimum uniform distribution over 180° is given by 


ae eee 0 or p=1,2,---N; 


uniform phase spacing over 180° (18) 


For an odd number of satellites, we have alternatively 
c, = ¢, + Ap — Dy p=1,2,---N, WN odd; 


uniform phase spacing over 360°. (19) 


(The c, are measured in radians in equations 18 and 19.) 

All equivalent optimum packing schemes, for a given N and a, result 
in the identical vyin, that is, minimum separation or closest approach 
between any pair of satellites at any time. However, other geometric 
properties (such as the average spacing) may vary widely for the 
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various packing schemes. The minimum separation is the only param- 
eter considered here by assumption wz of Section I. The systems im- 
plications of different packing schemes are discussed briefly in Section 
VIL. 

The results in this section, giving optimum packing of satellites 
on a single 8, are exact (subject to the assumptions of Section I). 
They are used in Sections IV and V to determine the best way to 
stack 8’s under various conditions and so determine the maximum 
number of synchronous satellites that can be used in an extended 
portion of the sky, subject to different constraints. 
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Fig. 15 —Satellite motion for N = 8, optimum packing. Phase increment = 


30°. Uniform phase spacing over (a) 180° and (b) 360°. 
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IV. OPTIMUM PACKING—UNSYNCHRONIZED, SEPARATED 8’S 
Assume that we are given: 


(t) The minimum allowable separation (that is, the closest ap- 
proach) between any two satellites = vmin. 

(21) The maximum allowable orbit inclination (equal to the maxi- 
mum declination of the satellites) = amax- 

Optimum packing on a single 8 has been studied in Section III. 
We seek to increase the number of synchronous satellites (over the 
number available in an equatorial system, that is, using a synchronous 
equatorial orbit only) by using such 8’s spaced equally along the 
equator. 


The canonical distribution for N satellites on an 8 is given by equation 
18 for general N, or for odd N by equation 19 (with other distributions 
possible as mentioned in Section JIT). For a given N and a given Umin ; 
it is obviously desirable to use the minimum a allowed, given by equation 
17 or Fig. 18, to minimize the space occupied by these N satellites. This 
equation and figure yield the maximum number of satellites Nin that 
can be placed on a single 8 for a specified Umin » Qmax - We therefore con- 
sider the improvement possible for N = 2, 3, -+- Nimax , With corre- 
sponding a@ = ay , Q@2, °** ONnmay XS Omax given by equation 17 or Fig. 13. 

It might have been anticipated for specified Umin » @max that the largest 
number of satellites per 8 (that is, Nma<) Will yield the greatest improve- 
ment. This is indeed true for the smaller values of Nx (perhaps Ninax 
< ~10). Although cases exist for Ninax > 18 where Nmax — Lor Nina — 2 
are slightly better, the difference is not significant. Consequently, it 
seems likely that the configuration corresponding to Ninaxy @vmax Will 
either be optimum or very close to it. 

Assume that 8’s each with N optimally packed satellites are spaced 
equally along the equator. The N satellites on each 8 must of course 
be carefully synchronized; however, we assume in this section that no 
synchronization will be required between the relative phase of groups 
of satellites on different 8’s. Two such 8’s are shown in Fig. 16. The 
minimum (great circle) distance between these two 8’s, Umm, must 
equal or exceed Umin in order that the closest approach between satel- 
lites on different 8’s equals or exceeds Umin. The distance between 8’s 
along the equator is denoted by £; its minimum value, fmin, corre- 
sponds to Unin = Vmin, that is, the minimum distance between 8’s 
equal to Umin. Two points are shown on the equator, whose minimum 
distance to their respective 8’s, tmin, is set equal to Umm; these points 
are a distance @ from each 8 measured along the equator. There is a 
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Fig. 16 — Geometry of adjacent 8’s. & = min for min = Umin. Umin == closest 
approach among N optimally packed satellites on each 8. 


portion of equator £ — 2¢’ long, available for additional synchronous 
equatorial satellites, separated from each other by a distance Umin.* 
We now define an improvement factor J that measures the effective- 


*The possibility of using smaller 8’s, rather than only equatorial satellites 
between the major 8’s, has also been considered. However, a crude analvsis that 
replaces sincs of small angles by their arguments and effectively neglects the 
curvature of the sphere shows that for N < 16 there is not enough room for a 
small 8 containing even two satellites between the two main 8’s spaced at the 
minimum distance {min. This suggests that small 8’s will be of no value for the 
range of intercst in this paper (V < ~ 25—sce Fig. 13), although the packing 
can obviously be improved for very large N (N > 25) by this means. A precise 
investigation of this problem requires a detailed study based on equation 11. 
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ness of our satellite packing schemes as follows: 


number of satellites per unit of Jongitude for 


packing scheme under consideration (20) 
? 


[T= 
number of satellites per unit of longitude 


using equatorial orbit only (that is, without 8’s) 


with closest approach the same in numerator and denominator. If 
E denotes the number of equatorial satellites between adjacent 8’s, 
then from Fig. 16 


/ 
k= p=} ee | (21) 
where the { } denote “the largest integer contained in” the enclosed 
expression.* For the packing scheme under consideration N + E satel- 
lites occupy a longitude interval ¢. For an equatorial system one satel- 
lite occupies a longitude interval vm, so that effectively £/Umin satel- 
lites occupy a longitude interval ¢. Therefore from equation 20 and 
21 the improvement factor is 


v1 4 a2 


‘g ? 
Umin 
(Notice that the denominator of equation 22 need not be an integer, 
as must the numerator.) 

I of equation 22 is a function of ¢. Suppose we first set € = fmin, 
that is, place adjacent 8’s as close as possible without causing (asyn- 
chronously phased) satellites on the two 8’s.to approach each other 
closer than Umin. In general the E equatorial satellites of equation 21 
will not exactly fill up the available space on the equator of length 
fmin —2’; that is, the quantity (min —2’) /Umin Will not be exactly an 
integer. oa 

We now inquire if increasing the spacing ¢ between adjacent 8’s 
beyond its minimum value can improve I of equation 22. The answer 
is sometimes yes, sometimes no. Intuitively, if (min —2€’) /Umin in the 
numerator of equation 22 is only slightly greater than an integer, that 
is, if there is only a little extra room on the equator at minimum 
distance between 8’s, then € = fmm gives the largest improvement 


I ¢ = (mite . (22) 





* Equation 21 is obviously valid only when it predicts EH = 0; it will appear 
below that F = 1. 
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factor J. However, if (fmin —2¢’)/Umin is only slightly less than an 
integer, in other words, if there is almost enough room on the equator 
for an extra satellite for € = min, 1b pays to increase € a little beyond 
fmin tO just accommodate the extra equatorial satellite. It clearly 
never pays to increase € further. 

Consequently, we evaluate J and E of equation 22 and 21 for the 
following two values of ¢: 


ie = Corin (23) 


I 


2¢7 Is Umin + tau = 2), (24) 


Vin 


$2 


The corresponding improvement factors J; and Iz, with corresponding 
numbers of equatorial satellites H; and He, are: 





N+1+ {fe — 2° 2} bon — BE? 
q, = min ; E, = {fain = 25° + 1. (25) 
Smin Umin 
Umin 
N +2 + (ian 20} min — 2¢ 
Gita z 2¢' 267 Vrain 
\ Umin } - Vain 3 : 


The optimum improvement factor is the greater of I; and Iz. It re- 
mains only to evaluate equation 25 and 26 from the results of Sections 
II and IIL. 

From equation 17 





- Umin = Ae 2a. : ee 
sin “3° = sin’ 5'sin 55 (27) 


From equation 10 and Fig. 8 


Umin 








. ae 2 Q. . c rene: a . 
sin“ = cos’ "sing — sin’ 5 } (28) 
consequently, 
P sin “2 
sin2 = + tan? =- (29) 
cos? = 4 
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We set Umin > Umin and €—> fmin In equation 29 to obtain 





2 & De 
= tan 5 (1 + sin 2) (80) 


From equation 8 and Fig. 6, with ¢ > @ to conform to the notation 
in this section and Fig. 16, 
COS tmin == COS” 57 C08 ¢’ + sin’ 5° (81) 
Using the double-angle formula for the two cosines, this becomes 
i (se 
2. = cos 5-sin > j (32) 
finally setting tnin > Umin in equation 32, 





sin 


v 
yee = 


2 


sin? 5 sin a 


cos = 
2 


Equations 27, 30, and 33 substituted in equations 25 and 26 permit 
evaluation of I (the improvement factor) and EH (the corresponding 
number of satellites on the equator between 8’s), as a function of NV 
(the number of satellites per 8) and @ (the orbit inclination or 
maximum dcclination of the satellites on the 8’s), for the two interest- 
ing values of spacing (along the equator) between 8’s given in equation 
23 and 24. These results are plotted on Figure 17; equation 25 is 
shown by solid lines, and equation 26 by dashed lines. 


Figures 13 and 17 used in conjunction determine the improvement 
factor for the present packing scheme. (Some values of N have been 
omitted in these figures for clarity, but may of course be supplied by the 
equations from -which these figures were obtained.) Given Unin ANd Omax : 


(7) On Fig. 18, find the largest value of N corresponding to vmin and 
@ <= dmx - Denote this value of N by Nix , the associated value of 
a DY Qymax } Nar S Omax + 

(iz) On Fig. 17, find the two possible improvement factors (solid and 
dashed) corresponding to Nmax » vm, 2nd choose the larger. Read the 
associated number of equatorial satellites from the appropriate curve.* 

* As mentioned at the beginning of this section, it-is possible that a smaller 
value of N < Nmax may yield a slightly, although not significantly, larger im- 


provement factor, Fig. 17 makes it clear that this is possible only for large N, 
and of course for intermediate N not plotted in this figure. 
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A variety of possible types of behavior is illustrated in Fig. 17. 
For some N the solid curve always exceeds the dotted curve (for 
example, N = 2, 8, 7, 10) and the 8’s should be spaced as closely as 
possible; for other N the reverse is true (as when N = 5) and the 
distance between 8’s should be increased just enough beyond its mini- 
mum value to permit one additional satellite on the equator between 
each pair of 8’s. In both these cases the number of equatorial satel- 
lites remains constant for all values of a shown. 

There are still other cases where both types of the above behavior 
hold for a given N; for small values of a the 8’s should be close- 


I- IMPROVEMENT FACTOR (COMPARED TO EQUATORIAL SYSTEM) 





@-ORBIT INCLINATION OR MAXIMUM DECLINATION IN DEGREES 


Fig. 17— Improvement factor for optimum satellite packing on unsynchronized, 
separated 8’s. Use this figure in conjunction with Fig. 13. £ = number of satellites on 


equator between two adjacent 8’s. N = number of satellites on a single 8. 
indicates minimum spacing between adjacent 8’s; closest approach for adjacent 
8’s = closest approach for a single 8. (See fig. 13 and equation 25). --—~— indicates 


increased spacing between adjacent 8’s that just permits 1 additional equatorial 
satellite. See equation 26. 
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spaced, for large a there should be an extra equatorial satellite (such 
as, N = 4, 20). Here the number of equatorial satellites increases by 
1 at a critical value of a. 

Finally, for N = 15 the behavior changes at a critical value of a 
in a different way. Here the number of equatorial satellites remains 
constant for all «2 shown. For small a the spacing between 8’s should 
be increased to allow for an extra equatorial satellite, while for large 
a the 8’s are close-spaced. 

The above relations simplify as « > 0 or as N > o. We have: 


























a= 0 
eee 
a Pex) z= |L L (34) 
eer: }sin 
N 
veal 
sin | 
i= t N’; Bes aE) cio (35) 
= te zt fe n) 
N= 0 
eee aa (36 
sin” * tan” 5 7 sin” 5 


It is clear from equation 36 that an upper bound on the improvement 
factor for the present packing scheme is J = 7 + 1, with H/N = 1/z. 
Figure 13 shows that this may be approached only for extremely small 
closest approach UVpin; hence practical improvement factors will be 
smaller. 


V. OPTIMUM PACKING—SYNCHRONIZED, OVERLAPPING 8’s 


In the packing schemes of Section IV, optimally-packed 8’s are 
spaced far enough apart so that closest approach for satellites on 
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adjacent 8’s equals or exceeds closest approach for satellites on the 
same 8, for arbitrary relative phasing on different 8’s. We now inquire 
if the number of satellites can be increased (for a given closest ap- 
proach and orbit inclination or maximum declination) by careful 
relative phasing on 8’s in such a way that they can lie closer together, 
and even overlap (with proper interleaving of satellites on different 
8’s). We restrict the present treatment to 8’s of identical size (that is, 
identical orbit inclination «) for simplicity, with equatorial satellites 
between 8’s as permitted. 

The basic relation needed for this study is equation 18 (and equa- 
tion 12), which shows that the separation between two satellites on 
different 8’s (of the same size) varies periodically with time in a 
simple way. We denote this minimum separation by Smin, occurring 
at satellite phases C1 min and Co min on the left and right 8’s (spaced by 
€ along the equator), respectively. Then 


Cimin = ao — - ) Comin = a i <a ’ 
1 Sain = gin? 2] gi ‘) aS 
sin “5 sin’ 5°] sin (:, 9 + cot’ 5 sin a0 (37) 





Co = Co — C1. 


We see that closest approach for satellites on two different 8’s of the 
same size occurs when the two satellites attain the same latitude; 
this behavior was observed in Section III for two satellites on the 
same 8 (compare the top lines of equations 14 and 37). smin of equation 
37 reduces tO Umin of equation 14 for ¢ = 0, the degenerate case in 
which the two 8’s become identical. 

Assume now that smin is given. Equation 37 then determines two 
forbidden regions for co; for every satellite on the left (No. 1) 8, 
with phase c,, two forbidden regions in phase space on the right (No. 
2) 8 are established, that is, two ranges of cy within which no satellites 
may be placed. Thus, in addition to the self-phase-space of Section 
III (determined by equation 14), which governs packing on each 8 
considered independently, a mutual-phase-space (determined by equa- 
tion 37) must be considered for each pair of 8’s lying closer than 
fmin Of Section IV (equation 30). 

Figure 18 and 19 give a convenient geometric picture of this pair 
of forbidden regions. We assume each 8 of the pair considered to 
contain N optimally packed satellites, according to Section III. Then 
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Fig. 18 — Forbidden region for relative phase between satellites on different 8’s 
(equation 37). N = 9. (a) General cane: cot? a/2-sin ¢/2 = 0.5 (separation about 
half that for two 8’s just touching), K = 0.6. Enlarged portion of upper curve, 
containing forbidden region to left of origin, shown in Fig. 19. (b) Limiting case 
(single 8). ¢ = 0, K = 1. This curve is the same as the lower curve of (a). 


closest approach on each 8 is Umin, given by equation 17 as 


sry al en tn ei 
sin “9 sin’ 5"sin 7 (38) 
We assume that snin, closest approach between 8’s, is less than Unin, 


according to the following relation for later convenience. 





sin 3" = K-sin “3, C2 1, (39) 
Thus, 
: Smin = 
sin “5 = sin’ 5K sin 5 0<K <1. (40) 
For zero spacing, § = 0 (Fig. 18b), the principal forbidden regions are 
located symmetrically about c. = —z, 0. In this case for K = 1 their 


width is d = 27/N. As the distance between 8’s increases (Fig. 18a) the 
centers of the forbidden regions approach each other symmetrically and 
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their width d increases, slowly at first. This pattern of forbidden regions 
repeats periodically with period 27. Figure 19 shows an enlarged portion 
of the principal forbidden region closest to the origin. The boundaries of 
the forbidden regions are defined in Fig. 18 and 19, the center of one of 
them in Fig. 19, at —D. Dimensions with a single arrow have a sign 
(for example, d, , d- , D, D), those with two arrows are positive (d, d/2), 
as before. 

From Figs. 18 and 19 and equations 37 through 39 the parameters 


0.5 







g f 


SIN(Co- 5) + cores sin ea 








Fig. 19—-Enlarged portion of upper curve of Fig. 18a, showing forbidden region 
closest to origin. 


Forbidden Region 


Exact Approximate 

D = 30.92° D = 30° 
d = 27.55° d = 27.2° 
d_=1715° D-$ = 164° 
° d ° 

d, = 44,7 D+ a 43.6 
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defining the forbidden regions are: 


Bast eee. £ aa 

d, = sin (cot 9 °Sin 5 + K sin 2) (41) 
p=St& (42) 
d=d,-—d_. (43) 


The centers of the principal forbidden regions are located at —D, 
— (ma — D).In phase space they are located as follows: 


ae ee eee 
7 (44) 
c _d Cis a 
9 ar+D 9 <% <5 r+ Dts 


The parameter cot” a/2-sin ¢/2, which appears throughout the above 
relations, is a normalized spacing parameter. For such a ¢ that the two 
8’s just touch, cot” a/2-sin ¢/2 = 1; therefore, for overlapping 8’s this 
parameter lies between 0 and 1, and for some of the following work 
where overlapping 8’s lie very close together cot” a/2-sin ¢/2 « 1. 
Since a < 30°, ¢/2 < 4.12° for overlapping 8’s; consequently the 
approximation sin ¢/2 — ¢/2 is valid for most purposes. 

In the large N case certain approximations are useful. D and d of 
equations 42 and 43 are approximated by D and d, given by equations 
45 and 46 and illustrated in Fig. 19. 


D = sin’ (cot 5'sin {) oD: N> 1. (45) 
2K sin 7 
Cee ay N> 1. (46) 


While these results are not sufficiently precise for all purposes, they 
serve at least as a rough guide for initial thinking about the problem. 
For the case illustrated in Fig. 19 (V = 9, K = 06, cot? a/2-sin £/2 
= 0.5) the approximate results of equations 45 and 46 yield a first 
forbidden region 0.35° too narrow (out of a total width of 27.55°) 
and 0.92° too far toward the origin. This case has large spacing, and 
the error will decrease for smaller ¢. In the approximation of equations 
45 and 46 the width of the forbidden regions varies linearly with the 
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secant of the displacement of their centers and with the separation 
reduction factor K, and the location of their centers is independent 
of K. 

A phase-plane picture of the forbidden regions of Figs. 18 and 19 
is illustrated in Fig. 20; for later convenience different shading and 
edges are used to denote the two forbidden regions. For zero spacing, 
€ = 0, the forbidden regions have minimum width d and are located 
symmetrically about 0 and z in the (cy — £/2) plane; as ¢ increases 
the forbidden regions approach each other and their width d increases, 
slowly at first. In Fig. 20a, K = 1 and d > 2x/N for ¢ > 0; in Fig. 
20b K = 0.6, and for the range of € shown d < 27/N. It is clear that 
given an upper bound on ¢, K can be chosen small enough so that 
d < 2x/N. 





g 


SINS = 


2a 
0.5 TAN? 5 


(b) K=0.6 


Fig. 20 — Forbidden regions of Figs. 18 and 19 in the (co — ¢/2) phase plane. 
N = 9. Figures are symmetrical around the vertical axis (¢ — ¢/2 = + 7/2). 
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We now make the additional assumption that each optimally- 
packed 8 possesses uniform phase spacing over 360°, as in equation 
19; this of course implies that N is odd. We inquire whether two or 
more such 8’s can overlap in a useful way, that is, so that improve- 
ment factors greater than those of Fig. 17 can be obtained (for the 
same minimum separation or closest approach). This assumption 
simplifies the following discussion by rendering all phase-space dia- 
grams N-fold rotationally symmetric, and does yield significant in- 
creases in the improvement factor. While superficial study of 180° 
packing on the individual 8’s seems to indicate lower improvement 
factors in certain simple cases, we have not made a careful study of 
this alternative case to determine what, if any, advantages it might 
have. Furthermore, we have not considered other possible (non-180° 
or non-360° single-8) packing schemes of Section III at all. 

Consider the case of two interleaved 8’s. The satellite phases on 
these 8’s are (equation 19): 


Gin =A +29—-N)e, p=1,2,-+-N; 
N odd. (47) 
Cio = Cay + 29-1) FZ, gq=1,2,---N; 
We define 
Co = Cap —~ Cy, p=l,2,°°- N. (48) 


All of the c;’s and c.’s vary linearly with time, increasing by 27 in 
one day. Figure 21a shows the corresponding phase-space plots for 
each 8, as determined in Section III. In this figure we assume without 
subsequent restriction that the phase of the p = 1 satellite on the Ieft 
8 is 0, C1¢1) = 0. We now inquire what is the optimum value for cy (the 
relative phase between corresponding satellites on the two 8’s) to 
maximize Spin, (the closest approach between satellites on different 
8’s), and thus maximize the packing improvement factor I. 

We investigate this question by a mutual phase-space picture 
derived from Fig. 20. Associated with each of the N satellites on the 
left 8 (No. 1) there are two forbidden regions in phase space co (or 
equivalently cg — ¢/2 where this is more convenient) determined by a 
figure such as one of those in Fig. 20. The details depend on the two 
parameters K (the separation reduction factor of equation 39) and ¢ 
(the equatorial spacing between the two 8’s). Figure 20 gives di- 
rectly the forbidden regions in cg — £/2 phase space corresponding to 
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N-1 
2 
N+1 
2 
ee N-t 
LEFT 8 (NO.1) Cy) =0 RIGHT 8 (NO.2) 
Cy CORRECT FOR {/Vyin=0.5 IN (b) 





Fig. 21—Self and mutual phase-space plots for 360° packing on each 8. N is 
odd, N = 9, a = 30°, K = 0.75. (a) Self phase-space plots for two overlapping 
8’s. Phase of satellite 1 = 0 on left 8. (b) Mutual phase-space plot: forbidden 
regions in (c2 — ¢/2) plane determined by satellites in left 8 with phases c: of left 
figure in (a). 


the No. 1 satellite on the left 8 in Fig. 21a; the corresponding forbid- 
den regions for the other (N — 1) satellites are determined by rotat- 
ing such a figure (N — 1) times by an angle 2z7/N. 

The resulting mutual phase-space picture, shown in Fig. 21b, gives 
the forbidden and permitted regions for satellites on the right (No. 
2)8.* The values of parameters have been chosen in this figure for 
convenience of illustration, and not necessarily for optimum packing. 


*The phase-space plots of Fig. 21 correspond to the particular time at which 
Cia) = 0; they rotate uniformly counterclockwise, at one revolution per day. 
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We may imagine that these figures are generated by two spoked wheels 
rotating in opposite directions on a common shaft. The clockwise 
rotating wheel has spokes indicated by shading lines, the counter- 
clockwise wheel by dots, corresponding to the two forbidden regions 
indicated in the drawings of Fig. 20. Clearly K < 1 in order that there 
be space between spokes for any ¢. As each wheel rotates the spokes 
widen, and the space between the spokes decreases, finally vanishing 
(€/Umin = 2 in Fig. 21b) as the wheels become solid. Before this hap- 
pens there are certain angles (£/Umin = 0.5, 1.5 in Fig. 21b) at which 
the spaces between the spokes on the two counter-rotating wheels line 
up, offering permitted regions for satellites on the second (right) 8. 


As indicated in this figure, it proves convenient to normalize the 
equatorial spacing ¢ between 8’s to Ynin , the minimum separation or 
closest approach on each 8 (equation 38 or 17, or Fig. 13). Figure 21 
shows that there are a number of actual or potential ‘‘windows’’ for 
interleaving two 8’s, at spacings of approximately ¢/0m;, & 0.5, 1.5, --- . 
It is clear that the width of the corresponding permitted regions in 
Co-space decreases (and eventually disappears) as the order of the 
‘‘window”’ increases, for a fixed K, since the spokes increase in width as 
¢ increases. Stated differently, the first window (corresponding to the 
closest spacing for the two 8’s) has the highest permitted value of K(such 
that the width of the permitted regions in c2-space just approaches zero). 
Since the highest packing improvement factor corresponds to the largest 
K, it is clear that best packing (considering only closest approach) is 
obtained at the first window, that is, with [/vn;, © 0.5 in Fig. 21. It is 
further clear that a larger value of K than indicated in Fig. 21 can be 


used.* 
Let ¢” denote the separation between 8’s at the first window with 


the largest possible value of K, such that the permitted regions in 
Ce-space approach zero width. Then from Figs. 20 and 21 


T or 
: d. = “3? d, = t5n" (49) 
From equation 41 
. i sine sin 
sh. age lre CON 2N 
sin “> = tan 9 5 | (50) 


* Although not shown in Fig. 21, it is clear that for ¢ = 0 there are no permitted 
regions (in cz - space) for K > 1%. This is the degenerate case in which the two 
8’s coincide, so that only single-8 packing is effectively considered. The present 
treatment provides an alternative phase-space derivation to that of Fig. 12 and 
Section III for the optimum single-8 packing results. 
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+ sin = 
K's sngy + aH (51) 
2 sin a 
Noting equation 38, an equivalent form of equation 50 that is some- 
times useful is 


cv 





gin —— sin CLs sisi 
ae sec’ <. 2N aN |. (52) 
in min 2 2sin = 
ea oD N 


Finally, the relative phase between corresponding satellites on the 
two 8’s (equation 48) is 


ce 
on ty 
N is odd, of course, throughout equations 49 through 53 (see equation 
47). We denote the above geometry by the term “close-spaced inter- 
leaved 8’s”. 

Figures 22 and 23 illustrate the spacing between interleaved 8’s 
and the corresponding reduction in minimum separation or closest 
approach (for satellites on different 8’s) for the above conditions. 
Except for the smallest N, the spacing between 8’s 2” is close to 
Umin/2, one half the closest approach for satellites on each individual 
8, as in the first drawing of Fig. 21b. Similarly, the closest approach 
Smin for satellites on different 8’s is only slightly less than Umin except 
for the smallest N.* Similar results may be derived for the higher- 
order windows, but as already mentioned they will have smaller 
values of K, and hence of the packing improvement factor. 

Figure 24 shows the geometric distribution of satellites for two 
close-spaced interleaved 8’s in the region of the equator, at three suc- 
cessive times, for large N. Corresponding satellites on the two inter- 
leaved 8’s have about the same longitude near the equator, neglecting 
the factor proportional to tan? (a/2) in satellite phase on the right 
(dashed) 8. In addition to the two intersections shown near the equa- 
tor in Fig. 24, the two 8’s have two other intersections, near the ends 
of the 8’s, where the satellites must also properly interleave, not 


C= (53) 


* Smin/Umin iS approximately equal to K, as in Fig. 23, to one part in 10-4 for 
the worst case N = 8, and to five parts in 10-6 for N = 7. 
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Fig. 22— Equatorial distance between two close-spaced interleaved 8’s. See 
equations 50 and 52. VN = number of satellites on a single 8. N is odd. 


shown in Fig. 24. Figure 25 illustrates motion over the complete 8’s 
for a moderate value of N, showing the general nature of closest ap- 
proaches near all four intersections. Each satellite has 8 closest ap- 
proaches per day with four different satellites on the other 8 (in ad- 
dition to four closest approaches per day, with two different satellites 
on its own 8). 


We calculate the improvement factor for such interleaved pairs 
(close-spaced—that is, at the first “window,” ¢/min 0.5 for large N) 
spaced equally along the equator. The 2N satellites on each interleaved 
pair must of course be carefully synchronized; no relative synchroniza- 
tion will be assumed between different pairs.* Two such interleaved 
pairs are shown in Fig. 26. 

The following discussion is quite similar to that of Section IV and 

* By synchronizing all of the different pairs they could be placed slightly closer 
together; indeed, the same is true for the (nonoverlapping) 8’s of the packing 
scheme of Section IV. The phase-space picture given earlier in the present section 
can easily deal with this problem; however, the possible increase in improvement 


factor is so slight, particularly for the larger N, that we do not pursue this 
additional complication here. 
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Fig. 16; however, the symbols have slightly different definitions here. 
The closest approach for satellites on different 8’s of an interleaved pair, 
Smin , 18 & little less than v,,;, , the closest approach on each individual 8 
(see equation 39 and Fig. 23). Hence the equatorial satellites are assumed 
to be separated from each other and from the adjacent 8’s by sn:, ; the 
minimum distance between adjacent 8’s, unin , must equal or exceed 
Smia ; and the equatorial system used for comparison in computing the 
improvement factor is assumed to have satellites separated by snin . 
With these changes, and otherwise folowing the discussion of Section 
IV, we have: 


(54) 


min 


RY = ¢ — 2¢’ number of equatorial satellites 
(i) = + 1, between interleaved pairs 


where the { } denote “the largest integer contained in” the enclosed 


1.00 
0.98 
0.96 


0.94 


Smin 
VMIN 


0.92 


= 
~= 


0.90 


0.88 





ie} 3°45 7 9 14 #13 15 17 19 21 23 25 


N-NUMBER OF SATELLITES ON A SINGLE 8 
N ODD 


Fig. 23—Separation reduction factor for two close-spaced interleaved 8’s. See 
equations 51 and 38 through 40. 





Smin 
= K to graphical accuracy 
Umin 
-Smin = Closest approach for satellites on different 8’s 


Umin = closest approach for satellites on the same 8 
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Fig. 24— Central portion of two close-spaced interleaved 8’s. Uniform phase 
spacing over 360° on cach 8. N is odd, N large, a = 28°. Tan? a/2 neglected in 
plotting satellites on right 8. 


expression. 


iz 2N + E = (2N + E)8nin > . 
I(g) = (¢ re ag ae fa | o 5] ¢ = Gain 9 


improvement factor. (55) 
I, = If), I, = I(f2); 
n= Gata ’ 2 = 2¢' + Sain lo Senin) 


I, corresponds to minimum spacing between adjacent interleaved 
pairs, Jz corresponds to just enough extra spacing to accommodate 
exactly one more equatorial satellite than with minimum spacing. 


(56) 
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7 a a vim 
K sin? =-sin = 


, D) 
i eee (57) 
2 a 
COS 5 
- en eS 2 a ° TT 
sin“ = tan 9 E + K sin a (58) 


Smin 18 given by equation 38 and 39 or Fig. 13. In the case of close- 
spaced interleaved pairs of 8’s (that is, using the first ‘“window”), ¢” 
is given by equation 50 or 52 or Fig. 22, and K by equation 51 or 
Fig. 28. 

Figure 27 shows the improvement factor in this case in much the 
same way as Fig. 17 for nonoverlapping 8’s. J; (corresponding to 





63° 82° 


Fig. 25 — Satellite motion for two close-spaced, interleaved 8’s. Phases indicated 
on respective 8’s. N = 5, a = 30°, phase increment = 9°, relative phase between 
two 8’s = 19.03°. 
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Fig. 26— Geometry of adjacent interleaved pairs of 8’s. § = §min for Umin = 
Smin. Smin = closest approach among 2N optimally packed satellites on each 
interleaved pair of 8’s. 


minimum separation between adjacent interleaved pairs) is shown by 
solid lines and Iz by dashed lines in Fig. 27. However, in contrast to 
Fig. 17, only the greater of I, or Iz, corresponding to the largest im- 
provement factor, is shown. As with the case of Section IV, simpler 
analytic forms are readily written for a > 0 and N — oo. We omit the 
former; the latter yields: 


I= 1, = 1, = 2¢——— +1. (59) 
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Equation 59 shows that 2x + 1 is an upper bound on the improve- 
ment factor for the present packing scheme, as compared with an 
upper bound of + + 1 for the scheme of Section IV (compare equa- 
tion 36). Figure 27 shows that this upper bound, like that of Fig. 17 
for the prior scheme, may be approached only for extremely small 
closest approach (very large N); consequently, practical improve- 
ment factors will be somewhat smaller. 

The packing scheme of Fig. 27 uses the first “window” for inter- 
leaving the overlapping 8’s. For N satellites per 8, there are (N—-1) /2 
different windows; expressions analogous to those of equations 49 
through 53 (for the first window) are readily written for the higher 
order windows. There are many different possible packing schemes 


in) 
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Fig. 27—Improvement factor for optimum satellite packing on separated close- 
spaced interleaved pairs of 8’s. Use this figure in conjunction with Figs. 13, 22, and 
23. # = number of satellites on equator between two adjacent interleaved pairs. 


N = number of satellites on a single 8. N is odd. —-——— indicates minimum spacing 
between adjacent interleaved pairs; closest approach for adjacent pairs = closest 
approach for a single pair (J; of equation 56). -—— indicates increased spacing 


between adjacent interleaved pairs that just permits 1 additional equatorial satellite 
(Iz of equation 56). 
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using the higher order windows; while it is clear that these have lower 
improvement factors, some of them may be of interest for other rea- 
sons. The improvement factor of this paper is based entirely on closest 
approach—minimum separation between any pair of satellites. It may 
pay to degrade this improvement factor in order to increase the aver- 
age separation between all pairs of satellites in some particular sys- 
tem, for example. Figure 28 shows one possible geometry that will 
have a somewhat lower improvement factor than the scheme of Fig. 
26. The basic relations of this paper permit the study of any of these 
packing schemes, but we do not pursue this matter further. 

In the above examples (Iigs. 26 and 27, or 28) only two 8’s are effec- 


tively interleaved in any given region; consequently, the fairly simple 
phase-space description of Fig. 21 suffices. We now inquire whether 





Fig. 28 — Geometry of symmetrically interleaved 8’s. Satellites on all 8’s must 
have correct relative phase. 
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Fig. 29— Windows in the ¢ — ¢y plane. N = 17, a = 30°, K = 0.674. K is chosen 
such that the fifth window is just beginning to open. 


Initial Appearance of Window 








¢ 
Window K 

Um in 

1 0.996 0.529 

2 0.962 1.569 

8 0.895 2.556 

4 0.798 3.456 
Present case > 5 0.674 4.239 
6 0.526 4.878 

7 0.361 5.350 

8 0.184 5.640 


greater improvement factors than those of Fig. 27 may be attained by 
using three or more interleaved 8’s. We do not know any general techni- 
que for investigating this problem, and investigation of a few special 
cases chosen at random has not produced any significantly greater 
improvement factor for parameters in the general region of interest 
(N Ss 25). Figure 29 is an alternative picture that casts additional light 
on the general problem of interleaving 8’s, even though its use has not 
yet led to higher improvement factors than those of Fig. 27. 

Figure 29 shows a plot of the “windows,” or permitted regions of 
spacing and relative phase between 8’s, for a given value of K, the 
separation reduction factor (and of course given values N = 17 and 
a = 30°). The assumptions here are the same as before in this Sec- 
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tion, in particular, 360° uniform phase spacing on 8’s of identical size 
(equation 47). Such an 8 may be interleaved with an existing 8 if the 
equatorial spacing ¢ and the relative satellite phase cp (equation 48) 
between the two 8’s are such that the corresponding point lies in a 
shaded region of Fig. 29. For K = 1 there are no windows; as K de- 
creases successive windows open up and become larger. As shown by 
the Fig. 29 table, the first window appears at K = 0.996, the second 
at K = 0.962, ..., until the fifth appears at K = 0.674, correspond- 
ing to Fig. 29; as K decreases further, additional windows open up, 
until all eight are open for K < 0.184. 

For € > €min (equation 58) in Fig. 29 for €/Umin > fmin/Vmin = 6.56, 
a (nonoverlapping) 8 may be placed with arbitrary relative phase co. 
The border of the shaded region at the right edge of this chart is 
slightly scalloped; by choosing co/(r/N) = 0.85 the adjacent 8 may 
be placed a little closer, €/Umin = 6.45, but as mentioned in the foot- 
note on page 2412, the potential improvement is so slight (if not 
nonexistent because it may pay to increase the spacing slightly to ac- 
commodate one more equatorial satellite) that we ignore it throughout. 

The curves of Fig. 29 are readily obtained from equations 41 
through 44 and associated discussion. Four sets of curves must be 
plotted: 


‘ee 
9 d_ 
Std.+z 
CG = +2(p—-1), p=1,2,---N; WN odd. (60) 
Bes 
2 as 
£ a 


This figure is periodic in the vertical direction, with period 27/N for 
Cy, aS &@ consequence of the 360° uniform phase spacing assumed on 
each 8. 

A chart such as Fig. 29 may be used to consider the interleaving of 
a number of 8’s in the following manner. Take a number of identical 
charts, cut out the windows, and cut away the paper to the left of the 
vertical axis. Lay the first chart down on a light-box. Subsequent 
charts are now laid down on top, with corresponding axes parallel, 
in such a way that the origin of each chart always lies on a lighted 
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area. Each chart corresponds to one interleaved 8, so that the total 
number of charts that can be so laid down equals the number of 8’s 
that can be interleaved for the particular value of K (and of N and 
a) chosen. The charts must all be translated (with axes parallel), 
subject to the above constraints, to obtain the maximum total num- 
ber; the vertical periodicity of these charts greatly reduces the num- 
ber of possibilities. 

After this has been done, if more 8’s must be interleaved it will be 
necessary to repeat the process for a smaller value of K. Since the 
resulting increase in the number of satellites per unit longitude is ac- 
companied by a decrease in the closest approach spin (and hence an 
increase in the denominator of equation 20 defining the improvement 
factor), it is not obvious, without carrying out this process, whether 
the improvement factor will be increased or decreased. 

It is clear that the process of the preceding two paragraphs does not 
provide an orderly approach to this problem. A few cases in the region of 
interest, N < 25, have been tried at random without improving the 
packing. We recall that since other geometric properties than simply 
closest approach or minimum satellite separation may be of interest, 
alternative packing schemes with comparable improvement factors may 
be of interest; however, we have not pursued these possibilities. 


VI. ILLUSTRATIVE SYSTEM SERVING NORTH AMERICA 


Figure 30 shows a representative satellite system designed to serve 
the United States and Canada. While this system has not been opti- 
mized, it should serve to illustrate the utility of the above results. 

In addition to the general assumptions of Section I, the following 
general restrictions guided this illustrative design: 


(t) The most efficient packing scheme that we have devised is used, 
close-spaced interleaved pairs of 8’s (Section V, equation 54 through 
59, Figs. 26 and 27). 

(72) Closest approach, or minimum separation between any pair of 
satellites, is about 1°. This would require ground antennas with beam- 
widths of a fraction of a degree. This might be achieved at frequencies 
above 12 GHz, possibly in the millimeter band. Such a proposal is 
somewhat speculative because of the lack of sufficient propagation 
data. 

(227) Minimum elevation above the horizon = 6.4°. 

(tv) Space in orbit is roughly allocated to what seems the best use. 


2422 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1968 


h 
cn 
°o 


165°, 150° 135° 120° 105° 90° 75° 60° 
7 ~ ANCHORAGE we 
Balas 
































HONOLULU " wh | Nendo ace ' 
renin NAAR RAREST 
15 Ue ae AVIVA TN Nom * : 
I | DL antl 

——— ee 
aaa eee ee ene aa 




















(a) = 
co} Gy 
LL 
t 
I 
I 
AB RIS 
EQUATORIAL 
SATELLITES 


Fig. 30—Satellite system for North American continent. Total satellites = 
477; total in equatorial system occupying same longitude = 95. Packing scheme: 
close-spaced interleaved pairs of 8’s, at minimum spacing ({ = {min in Fig. 26). 
Design parameters: a = 25°, orbit inclination. Umin = 0.986° , Smin = 0.982°, closest 
approach. N = 17, number of satellites on a single 8 (individual satellites not in- 
dicated). E =5, number of equatorial satellites between adjacent pairs of 8’s. 
= 5.34, improvement factor (Fig. 27). Minimum elevation = 6.4°. 8’s and equa- 
torial satellites that can serve representative paths without allowing elevation to 
decrease below 6.4° shown by horizontal arrows, determined from Fig. 32. 


Thus, space suitable for trans-Atlantic or trans-Pacific communica- 
tion has generally not been used. Satellites have been placed farther 
from the United States coast over the Pacific than over the Atlantic 
because the Pacific is wider. 

(v) 8’s have been omitted where they would not appear needed 
(over Hawaii), on the basis of a crude estimate about relative traffic 
density. 

(vi) In such a system each satellite might serve every ground sta- 
tion that it can see, within limits imposed by the resolution of the 
satellite antennas. Referring to Fig. 2, for the minimum elevation 
chosen for this example (6.4°), an orbit inclination «# = 30° allows 
no tolerance at all in longitude for a ground station at 45° north lati- 
tude, somewhat south of the western United States-Canada border. 
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Consequently, a somewhat smaller inclination (@ = 25°) has been 
chosen in Fig. 30. 
Figure 31 shows the region of visibility of a satellite from ground 


stations at several different north latitudes to the same scale as Fig. 
30. Equatorial satellites and satellites at 25° south latitude, of inter- 
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Fig. 31— Region of visibility of a satellite from ground stations at representa- 
tive north latitudes, assuming minimum elevation of 6.4° above the horizon. Same 
scale as Fig. 30, so that these curves give directly permitted 8’s and equatorial 
satellites for representative paths on Fig. 30. 
est in Fig. 30, are shown in more detail on Fig. 32.* Using these data, 
the ranges of 8’s and of equatorial satellites that can serve eight 
representative paths have been shown by the horizontal arrows on 
Fig. 30. 

Two general conclusions are readily apparent: 


(2) The equatorial satellites are more versatile than the 8’s satel- 


* The derivation of these results, which is elementary, is given in Appendix B 
for convenience. 
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_ Fig. 32— Maximum relative longitude from ground station for which satellite 
is visible 6.4° above the horizon. 


lites; only the former can serve paths north of the United States- 
Canada border. 

(12) The most versatile of the 8’s satellites lie over the central 
United States; the further from the center, the smaller the variety of 
paths served. 


Assuming that each satellite carries transponders to serve all ground 
stations that it can see, the equatorial satellites will be the heaviest, 
the central 8’s satellites next heaviest, and the edge 8’s satellites the 
lightest. 

It is obvious that the more easterly satellites of Fig. 30 can also 
serve Central and South America (with additional transponders for 
this purpose). 

We re-emphasize that this proposed system is only for illustration. 
Much detailed study would be required to choose the best parameters, 
even within the assumptions imposed here. 
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VII. DISCUSSION 


The best packing scheme in this paper yields an improvement fac- 
tor of roughly six over a purely equatorial system in the range of 
probable interest (see Fig. 27). This result is based on a great many 
assumptions, already stated. We believe that the relaxation of all of 
these assumptions should be investigated further. While we have 
tried to make reasonable choices, we have no assurance either that the 
improvement factors of Fig. 27 are the best that can be obtained, or 
that further work will yield better packing schemes (or alternative 
packing schemes of interest for other reasons). 

The treatments of optimum packing a single 8 (Section III) and 
separated 8’s (Section IV) are virtually complete.* However, the 
treatment of overlapping 8’s in Section V, which yields the largest 
improvement factor, is far from complete. We do not know how to 
approach this latter problem in a general way. As a start, it seems 
possible that extending the present treatment of Section V to 8’s of 
moderately different sizes (a’s) might allow the interleaving of more 
than two 8’s with significantly greater improvement factors. 

All of the discussion has been based on the assumption that each 
satellite remains in view of every ground station that it serves and no 
switching is required. This represents one extreme; the other extreme 
is a low-altitude system, with frequency switching. Intermediate sys- 
tems are also possible, with switching several times a day. If the as- 
sumption is relaxed to allow this, other quite different systems become 
possible. Two considerations are: 


(i) Larger 8’s (with a larger inclination «) may be used; whether 
this would lead to any advantage for the United States remains to be 
studied. However, Europe is enough farther north than the United 
States that figure 8 systems may not be useful unless occasional 
switching is allowed. 

(17) The antenna scanning and beam shaping problems may be 
eased. For example with 360° packing on each individual 8, as as- 
sumed in the interleaved packing schemes of Section V, each beam 
would need to scan a much smaller angle. 


Once occasional switching is allowed, lower-than-synchronous orbits 
may be considered. All of the packing schemes discussed above still 
apply, being based only on circular orbits and not on how fast the 


* Not all items of interest have been investigated, but we believe the basic 
relations given here are sufficient for most purposes. 
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earth happens to be rotating. However, now the satellite would have 
to extend all the way around the earth, rather than say one quarter of 
the way around as in Fig. 30 for synchronous orbits. Further, the 
mutual visibility problem becomes worse as the altitude decreases; 
additional study would be required to determine the resulting im- 
provement factors. 

In Sections IV and V different 8’s and different interleaved pairs of 
8’s, respectively, were allowed to have arbitrary relative phase. We 
now inquire what use can be made of this additional parameter. As 
already mentioned, slightly closer packing and hence slightly greater 
improvement factors can be obtained in some cases by the correct 
choice of relative phase, although this increase is small and will 
yield only a few additional satellites in the illustrative system of 
Fig. 30. 

A more interesting possibility is to use this parameter to reduce the 
number of different orbits required, so that one vehicle can launch 
many different satellites lying on the same orbit but on different 8’s. 
From the basic definitions of Fig. 4, it is obvious that two satellites 
lying on the same orbit but on different 8’s of identical size (a), 
spaced along the equator by Z, have a relative phase of ¢. The 2 x 12 
X 17 = 408 8’s satellites of Fig. 30 generally require 408 orbits. By 
proper choice of relative phase (¢ + €” = phase difference between 
adjacent pairs—see Fig. 26), only 34 different orbits are required; 
thus only 34 launch vehicles, each carrying 12 satellites, need be used. 

A study of the fuel requirements to maintain desired tolerances in 
satellite positions for the different systems is needed. It is not obvious 
whether this requirement is greatly different for the different systems. 
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APPENDIX A 


Derivation of Fundamental Results of Section II 


Consider on Fig. 4 the isosceles (spherical) triangle composed of the 
equator, the orbit, and the great circle connecting the satellite to the 
earth reference point (at the intersection of the reference longitude and 
the equator). This triangle has two equal sides of (great circle) length 
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c (measured by the angle subtended at the center of the sphere) with 
included angle a, and a third side of length a; let the remaining two 
equal angles be denoted by y (not indicated on Fig. 4). Noting that 
y + « = 7/2, the law of sines yields 


sin@ _ COS € (61) 
sna since 
Now bisect the above triangle into two equal right triangles by bisect- 
ing the angle « (with a great circle). From the law of sines 





sin = 
2 1 

i a ~ gine’ (62) 
2 


which yields equation 1. We now use equation 1 to eliminate the 
parameter a from equations 61 by means of the double-angle formula, 
yielding 


1 — sin’ 5 
(08.6. SS) Se (63) 
1 — sin’ 5 sin’ c 
Equation 2 now follows directly. 

Equation 3 follows from the law of sines applied to the right tri- 
angle of Fig. 4 whose sides are the orbit, the equator, and the dotted 
longitude passing through the satellite. The law of cosines applied 
to this triangle yields 


cos c = cos l-cos e. (64) 
This becomes 
sin’ c — sin’ \ 
a ( 1—sin?l /’ (65) 


substituting equation 3 into equation 65 yields equation 5. 

Consider next the right triangle formed by the equator, the dotted 
longitude, and the great circle joining the satellite to the earth ref- 
erence point (defined in the first sentence of this appendix) in either 
Fig. 4 or 6. The law of cosines yields 


cos a = cos l-cos y. (66) 
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Thus 


sin? a — sin’ 1\} 
siny = (HSE). 7 
Using the double-angle formula, substituting equations 1 and 3 into 
equation 67 yields equation 4. 

Consider now the right triangle of Fig. 6 formed by the reference 
longitude, the great circle connecting the satellite to the earth ref- 
erence point (see above), and the great circle of length x passing 
through the satellite and normal to the reference longitude. From the 
law of sines 


sin x = sin a-sin e. (68) 


Using equations 1 and 2 to eliminate a and e«, together with the double. 
angle formula, equation 7 is readily obtained. 

We next derive the general result of equation 11, obtaining all the 
remaining results of Section II by specializing this relation. Applying 
the law of cosines to the spherical triangle of Fig. 9 whose vertices 
are the north pole and the satellites on each of the two 8’s, 


cos s = sin /,-sin 7, + cos l,:cos l,-cos (¢ + Wi — Yo). (69) 
Make the following substitutions in equation 69: 


(2) equation 3. 
(wz) Expand cos (€+W1—y2). 
(wz) From equations 66 and 1, and the double-angle formula, 


° a. 
1 — 2-sin? 5 "sin" c 


cos l-cos y 


Qa ° a 
cos” 5 +- sin’ 5° COS 2¢ 


(tv) From equations 3 and 4 


cos I-sin Y = sin’ 5 sin 2c. 
After lengthy but straightforward transformations equation 11 is 
obtained. We readily obtain equation 8 (see Fig. 6) by the second 
transformation in the table following equation 11. For equation 10, 
the fourth transformation of this table yields 
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+2 
cos U = COS f-cos' 5 + ee [cos (2c. — £) — cos 2c] 


sin? a 
2 


Make the following substitutions in equation 70: 





+ + sin* = cos (4c. — &). (70) 


(2) cos (2c. — £) — cos 2c. = 2 sin (26. — £) sin é. 





(12) cos o = 1 — 2sin’ S ; oc =U, £, 4, — §&. 
+2 
see sim a . 2a 2a 
(222) oS 2 sin 9° C08 5 
We have after combining terms 
2 
sin’ 5 = | cos “sin § — sin’ 5 sin (26, = s)] ; (71) 


simply the square of equation 10. 

To derive equation 18, substitute equation 12 into equation 11 and 
use the double-angle formula on the a-dependent factors of the third 
and fourth terms to yield, after some minor rearrangement, 


a sin’ a 


9 00S (24-5) + 





a . 
cos s = cos ¢- cos" 5 + gin* “COS Co 


s 2 
+ =" [eos (2c, +) —¢) + & + cos (2c; + & + & — §) 
— 2 cos (2c, + c)]. (72) 
The three terms inside [ ] may be regarded as an AM wave and so 
are readily combined, to yield 
sin” a 


2 


a a 


cos $ = cos’ 57 COS ¢ + sin’ 9° 008 (265.0) 





COS Co 


: 5, Ch 
— gin? a:-sin’ ( of 





) cos (2c, + e). (73) 
Make the following substitutions in equation 73: 


. . Co 
(2) cosg = 1 — 2sin’ 5 ; g = 8, f, 2) — §,%. 
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(it) cos (2c, + ¢)) = 2 cos” (c, + ) — 1. 





2 
: 2 
es sin @ gy 2h 2 aw 
(272) 5S 2 sin 9° 008 5 
Equation 13 is obtained directly. Equation 9 follows immediately, as 
pointed out in the text. 


There are of course a great many alternate ways of deriving these 
various results, 


APPENDIX B 


Visibility of a Satellite from a Ground Station 


Figure 33 shows the earth, of unit radius, concentric with a sphere 
of radius R, equal to the radius of a circular satellite orbit. For syn- 


SYNCHRONOUS ORBIT:R= 6.624 


~ 


A, 
/ Og 9 





Fig. 33 — Region of visibility of a satellite. Synchronous orbit: R = 6,624. 
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chronous orbits R = 6.624. The elevation of a satellite viewed from a 
ground station must exceed the angle w; the satellite is restricted to a 
region of angle from the ground station (measured at the center of 
the spheres) less than a. Projecting the satellite on a sphere (the 
earth in Fig. 33), the satellite is restricted to a region within a circle 
centered on the ground station (shown dashed in Fig. 33) of (great 
circle) radius a, with the usual convention of spherical trigonometry 
that great circle distances are measured by the angle they subtend at 
the center of the sphere. From Fig. 33, 


_ R-cosa — 1 


csc a 
tanw = Rage cot a — ae (74) 
Solving for a, 
; COS w i sin 2w 
sin a = Cos *E — ( R ) ary (75) 


At synchronous radius, 
sin a = cosw(1 — .0228 cos’ w)? — 0.0755 sin 2w, 
R = 6.624. (76) 


The relation between latitude J! and relative longitude y on the 
dotted circle is found from Fig. 34. The ground station is located at 
north latitude ly. From the law of sines, 


cos J-sin y = sin a-sin e. (77) 
From the law of cosines 
sin 1 = cos a:sin 1) + sin a-cos I)-cos e. (78) 


Substituting equation 76 into equations 77 and 78, the results of Figs. 
31 and 32 are readily computed parametrically in terms of the 
azimuth e. 


APPENDIX C 


Error in Satellite Separation 


The error in separation between satellites resulting from displace- 
ment from the center of the earth is readily calculated. Fig. 35 shows 
the earth and a pair of satellites in circular orbits of radius R, sepa- 
rated at this instant by angle s subtended at the center of the earth. 
The side view shows a ground station; the plane of the paper includes 
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S) 


Fig. 34 — Latitude and longitude of region of visibility. 


the center of the earth, the ground station, and the upper satellite. 
The poles of the earth are not necessarily vertical in this figure. The 
lower satellite does not in general lie in the plane of the paper; the 
end view shows a line joining the two satellites, making an angle 7 
with the plane of the upper satellite, ground station, and center of 
the earth. 

In the case of interest here the satellites lie close together. Con- 


PLANE OF SIDE VIEW, 
SYNCHRONOUS ORBIT:R=6,624 I 


| | GROUND 
STATION. 


_—— 
~_~ 






z ~_LRscost | 4 Rs-cost 
IPE ai 


~S GROUND STATION 


Rs+Cost+-Cospg 


SIDE VIEW END VIEW 


Fig. 35—Satellite separation as a function of ground station location. s = 
angular separation between satellites at center of earth. Distance between satel- 
lites (Rs) and its projections are true (not angular) lengths, 
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sequently we assume throughout this appendix that 


s<l. (79) 


The ground station is restricted to the region of the earth where the 
satellite remains visible, the limiting case corresponding to zero eleva- 
tion.* Consequently the angle 8 of Fig. 35 is bounded by 


B< Bmex ; SIN Bmax = 1/R. (80) 
The distance R’ from the ground station to the satellite has maximum 
and minimum values corresponding to maximum and minimum values 
of B. Thus 
1 


R’ 1\3 
1 — bcm < 005 Bau = (1 — 4) (81) 


We seek the angular separation between the two satellites as seen 
from the ground station; let this quantity be denoted by s’. We pro- 
ject the line segment joining the two satellites onto the plane perpen- 
dicular to the line R’ joining the satellites and the ground station as 
shown. Then 


s'/s = (R/R’)[1 — (sin B-cos 7)*]. (82) 
We have 


L<si/s< peu 


1° (83) 


For synchronous orbit 


1 <s'/s < 1.178. (84) 


Thus as stated in Section I, for synchronous orbit the approximation 
of this paper which determines separation between satellites as seen 
from the center of the earth (rather than from ground stations) ranges 
from correct to 18 per cent too conservative for the critical cases of 
closest approach. 
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